THE BELL SYSTEM 


echnical 


DEVOTED TO THE SCIENTIFIC 


AND ENGINEERING 





ASPECTS OF ELECTRICAL COMMUNICATION 








VOLUME XLIII MAY 1964 NUMBER 3 
Digital Light Deflection T.J. NELSON 821 
Perturbation Methods for Satellite Orbits F. T, GEYLING 847 
Methods of Orbit Refinement R. B. BLACKMAN 885 


On the Response of Nonlinear Control Systems to Periodic In- 
put Signals I. W. SANDBERG 911 


Digital Computer Simulation of a Four-Phase Data Transmis- 
sion System M. A. RAPPEPORT 927 


NEASIM: A General-Purpose Computer Simulation Program for 
Load-Loss Analysis of Multistage Central Office Switching 
Networks R. F. GRANTGES AND N. R. SINOWITZ 965 


Filling Factor and Isolator Performance of the Traveling-Wave 
Maser F. S. CHEN AND W. J. TABOR 1005 


The Comb-Type Slow-Wave Structure for TWM Applications 
F. S. CHEN 1035 


A Comparison of Permanent Electrical Connections G. w. MILLS 1067 


Evaluation of Solar Cells by Means of Spectral Analysis 
H. K. GUMMEL AND F. M. SMITS 1103 


A Theory of a Unilateral Parametric Amplifier Using Two Diodes 
J. HAMASAKI 1123 


ee 

Contributors to This Issue 1149 
B.S.T.J. Briefs: The Use of Wollaston Prisms for a High-Capacity 

Digital Light Deflector, w. 5. TABOR 1153 





COPYRIGHT 1964 AMERICAN TELEPHONE AND TELEGRAPH COMPANY 


THE BELL SYSTEM TECHNICAL JOURNAL 


ADVISORY BOARD 


P. A. GORMAN, President, Western Electric Company 
J. B. FISK, President, Bell Telephone Laboratories 
J. E. DINGMAN, Executive Vice President 


American Telephone and Telegraph Company 


EDITORIAL COMMITTEE 


A. C. DICKIESON, Chairman 

F, T, ANDREWS, JR. K, E. GOULD 

A. J. BUSCH Ww. C. HITTINGER 
R. P. CROSS J. R. PIERCE 

R. L. DIETZOLD D. H. PROCKNOW 
R. W. EHRLICH E. C. READ 


EDITORIAL STAFF 


. SCHINDLER, JR., Editor 
. COLE, JR., Assistant Editor 
. MYSAK, Production and Illustrations 


Hak © 
42485 2 


. POPE, Circulation Manager 


THE BELL SYSTEM TECHNICAL JOURNAL is published six times a year 
by the American Telephone and Telegraph Company, E. J. McNeely, President, 
C. E. Wampler, Vice President and Secretary, J. J. Scanlon, Vice President 
and Treasurer, 195 Broadway, New York, N. Y. 10007. Subscriptions are accepted 
at $5.00 per year. Single copies $1.25 each. Foreign postage is $1.08 per year or 
18 cents per copy. Printed in U.S.A. 


THE BELL SYSTEM 
TECHNICAL JOURNAL 


VOLUME XLIII May 1964 NUMBER 3 


Copyright 1964, American Telephone and Telegraph Company 


Digital Light Deflection 


By T. J. NELSON 
(Manuscript received July 30, 1963) 


A digital method of deflecting a light beam using n optical modulators 
and n untaxial crystals to provide 2” positions of the beam is described. The 
input-output relations for one special configuration are derived. Optical 
problems and limitations are investigated and, in particular, tt is found that 
the upper limit to the density of positions is about 10°/sq. in. Presently avail- 
able modulators are considered, and it is found that a KDP modulator has a 
power limitation above 1 mc operation for a total number of positions of 
about 70,000. Finally, the applications of the method as a semipermanent 
memory, a PCM decoder, and a digital delay line are briefly considered. 


I. INTRODUCTION 


It is desirable to substitute a light beam for the electron beam used in 
the class of devices of which the flying spot scanner and the Williams 
tube are examples. In this class of devices the electron beam is used to 
probe a suitable target and read or store information on it. The substi- 
tution is desirable because a light beam has negligible inertia and can, 
in principle, be deflected rapidly. The subject of this paper*: ft isa new 


* The contents of this paper were discussed by the author at the Twenty-First 
Annual Conference on Electron Device Research (IEEE), Salt Lake City, Utah, 
June 26-28, 1963. 

+ A brief description of methods that can be used to deflect a light beam is given 
by U. J. Schmidt (Schmidt, U. J., The Problem of Light Beam Deflection at High 
Frequencies, Proceedings of the Symposium on Optical Processing of Information, 
ed. Pollack, D. K., Koester, C. J., and Tippett, J. T., Spartan Books, Inc., Balti- 
more, 1968, p. 98). 

t Note Added in Proof. An article on some aspects of digital light deflection 
has recently appeared in the literature: see Kulcke, W., Harris, T. J., Kosanke, 
K., and Max, E., IBM Journal of Research and Development, 8, 1964, pp. 64-67. 
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method of deflecting a light beam. This method, which is called digital 
light deflection, employs n optical modulators and n uniaxial crystals. 
By applying appropriate two-level electrical inputs to the n optical 
modulators, it is possible to deflect a light beam to 2” positions. This 
deflection technique is inherently digital and the required optical modu- 
lator inputs are binary signals. 

In the present paper we shall consider: 

(a) the basic principles of digital light deflection 

(b) the address logic for a special configuration 

(c) optical problems which limit the density of positions that can be 
achieved 

(d) presently available optical modulators and the speed limitations 
which they impose on a digital light deflection system 

(e) experimental results that have been obtained with a four-unit (16- 
position) digital light deflection system. 


II. PRINCIPLES OF DIGITAL LIGHT DEFLECTION 


2.1 The Binary Unit 


It is well known that uniaxial crystals have the property of displacing 
light of one polarization, called the extraordinary ray, while the orthog- 
onal polarization, the ordinary ray, obeys Snell’s law. If the two rays 
are parallel upon entering the crystal, they will be parallel upon leaving, 
but are not parallel inside the anisotropic medium. 

Fig. 1 shows a uniaxial crystal oriented so that its optic axis lies in an 
vz plane. If the electric field vector of a plane-polarized beam is in the 


MODULATOR UNIAXIAL CRYSTAL 


ma 





OPTIC AXIS LIES IN 
Li x AN %Z PLANE 


Fig. 1 — The binary unit, 
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x direction, it will be displaced by an amount proportional to the thick- 
ness of the crystal. The exact dependence of the deflection on the crystal 
thickness and orientation is given in Appendix A. If the beam is polarized 
in the y direction, its electric field vector is normal to the optic axis, and 
the beam will pass through the crystal in a straight line. 

If a modulator precedes the uniaxial crystal and is capable of rotating 
the plane of polarization from the x direction to the y direction, and in- 
versely, under the influence of an input signal, then it is possible to 
switch the beam from one position to another. Furthermore, an error in 
the input signal will cause the beam to be split and light transmitted 
simultaneously to the two positions, rather than to some other position, 
as would be the case with an analog deflector. 

The combination of optical modulator and uniaxial crystal, in Fig. 1, 
is referred to as a binary unit. Because the use of a multiplicity of binary 
units will be considered next, subscripts identify the variables of a par- 
ticular binary unit. For example, ¢; represents the thickness of the uni- 
axial crystal in the zth binary unit and 2; describes the state of the 7th 
modulator. If x; = 1, then the plane of polarization of the incident light 
beam is rotated 90° by the zth modulator; whereas if x; = 0 the rotation 
is zero. 


2.2 The Deflection Bank 


If a linearly polarized light beam is made to traverse a sequence of n 
binary units of the type described, then a maximum of 2” positions of 
the beam can be realized. Such a combination of n binary units is desig- 
nated as a deflection bank, and if the deflections are all in the x direction, 
as the x-deflection bank. The resulting pattern of deflections produced 
by a bank depends upon the thickness and orientation of the various 
uniaxial crystals in the bank. 

A linear pattern of 2” positions, uniformly separated, can be obtained 
with an n-unit bank if all uniaxial crystals have the same orientation and 
if their thicknesses are respectively to , 2t0, --- , 2” to. If the beam dis- 
placement in the thinnest crystal is d), then the separation between 
positions is also d). Each binary unit in such a bank is unique, as its 
uniaxial crystal differs in thickness from all other uniaxial crystals. The 
binary units in such a linear deflection bank may be arranged in any 
order, but we shall confine any further discussion to a configuration where 
the light beam successively encounters uniaxial crystals of thickness 
given by 


t= 2°; t= 1,2,-+-,n. (1) 
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This configuration will be shown later to have interesting and desirable 
properties. 

A three-unit example of the configuration that has been singled out 
is shown in Fig. 2. The polarization of the beam before it enters the bank 
is orthogonal to the optic axes of the uniaxial crystals and the thickness 
of successive uniaxial crystals decreases by two. The two columns at the 
right of Fig. 2 give the positions of the beam in binary form and 
the modulator inputs which are required to deflect the beam to these 
positions. 


2.3 Address Logic 


In this section, a derivation of input-output relations and a discus- 
sion of code conversion are given for the special linear bank configu- 
ration which was mentioned in the previous section. 


2.3.1 Input-Output Relations 


For the purpose of deriving the deflection as a function of the modu- 
lator inputs, it is assumed that the polarization of the beam, as it enters 
the bank, is orthogonal to the optic axes of the uniaxial crystals in the 
bank. Then if none of the modulators are operated, the beam is unde- 
flected by the bank. Since the displacement of the beam by unit 7 is de- 
termined by the plane of polarization of the beam after it has passed the 
modulator in unit 7, whether deflection occurs is determined by the num- 
ber of times the polarization of the beam has been rotated up to, and 
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REPRESENTATION 
OF DEFLECTION 








y Ty Lary 
—+——000———— 000—— 
——001————001—— 
—-—010————o11-—— 
—~011————010—-— 
——100————110—-— 
——101—-——— tt 
——110———— 101 -— 
44 { SS a 4100S 


























INPUTS TO THE 
MODULATORS 





x, 


Fig. 2 — Three-unit deflection bank. 
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including, unit z. If we designate the deflection at unit 7 as a binary vari- 
able, d;, then the displacement of the beam is 2” ‘dy if d; = 1 and 0 
if d; = 0. The general formula for the deflection is 


d= Rodd Cigtey * 4 42y) (2) 


and 


2 


> ot d; do (3) 


t=1 


dz 


where d, is the total deflection of the beam in the zx-deflection bank. Souq 
is the notation for a symmetric switching function and its value is one 
if an odd number of the indicated variables have the value one, and zero 
otherwise.’ Thus unit 7 will add its displacement to the total if the polar- 
ization of the beam has been rotated an odd number of times up to, and 
including, the zth unit. 

The use of (2) and (3) is illustrated by considering the three-unit bank 
in Fig. 2. Suppose the input to the modulators is 


UyXLoN3 = 111 


and we wish to find the corresponding deflection. According to (2) we 
count, starting from the left, the number of ones appearing up to and 
including the position under consideration. If the result is odd we enter 
a one at the corresponding position of the deflection variable. Hence in 
this case 


dideds = 101. 


Substitution of these deflection variables into (3) yields that the total 
deflection d, = 5do. 

In general, the addressing or input-output relations have not been 
completely specified until the inverses of (2) and (3) are given. If we 
desire to have unit 7 add its contribution to the total displacement but 
not unit  — 1, or if we desire the displacement of unit 7 — 1 but not 
that of unit 7, then it is clear that the modulator in unit 7 must rotate 
the plane of polarization of the beam. This fact is the inverse of (2) and 
is expressed by 


Ci = oda(di-4 ; d;). (4) 


The inverse of (3) merely translates the binary number 21, 2, °+* , Xn 
from base two to base ten. 


0. Deu re (5) 
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The use of (4) is apparent if we again consider the three-unit bank in 
Tig. 2. Suppose a total deflection d, = 5d» is desired and we wish to find 
the necessary inputs; then (4) is appropriate. First, however, (3) is used 
to obtain the deflection variables d;. Use of (3) gives in this example, as 
we have seen before, did2d; = 101. According to (4), if an entry in the 
deflection variable differs from the preceding entry, the corresponding 
entry in the modulator variable x; is a one. Sinee the last two entries in 
dydyds; = 101 are 01, then x; = 1. Proceeding in this fashion we obtain 


UV = 111 


which is the expected result. 

An interesting and desirable property of the configuration that has 
been considered is that it is always possible to switch the beam to ad- 
jacent positions by changing the state of excitation of just one modulator. 
This is apparent for the three-unit bank in Fig. 2. To show this in gen- 
eral, consider the highest numbered unit, 7, for which the number of rota- 
tions of the plane of polarization up to and including unit j is even. 
Evidently, all the units following 7 add their contributions to the total 
displacement. Since 


OP Se OF Pa Oe cect OF te so! (6) 


we have only to change the state of excitation to modulator j to increase 
the total deflection by the incremental distance, d). Also, consider the 
highest numbered unit, k, for which the number of rotations up to and 
including unit k is odd. Then unit & adds its contribution to the total 
deflection, but none of the units following k do. Hence, by changing the 
state of excitation to the modulator in unit k, we decrease the deflection 
by dy. The reasoning breaks down in the first case if the excitation is 
100 --- 0, where the number of rotations is odd for all units, and in the 
second case if the excitation is 00 --- 0. However, the total deflection is 
then either a maximum or zero, and these two conditions are reached 
from each other by changing the state of excitation to the modulator in 
unit 1. Hence the input-output relations for this configuration are cyclic; 
Le., adjacent positions can always be reached from each other by chang- 
ing the state of excitation to exactly one modulator. 


2.3.2 Code Conversion 


One can ask, what type of operation can be performed on the input 
signals to further simplify the input-output relations? If the inputs to a 
switching network are designated by the W,’s and the outputs by the 
x,’8, which are, as before, the direct inputs to the modulators, one type of 
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code conversion could be 


rt; = Scaa(Wi-a, Wi). — 
It is known from (4) that 
t: = Soaa(di1 , di); (8) 
hence, from an inspection of (7) and (8) it is evident that 
d; = W;. 


Thus, the deflection and input variables are caused to be the same binary 
numbers. If the W,’s are the outputs of a series of bistable multivibrators 
which are triggered from a serics of harmonically related sine waves, then 
Wi,We.,--+,W, and hence d;,d2,--- ,d, assume the binary numbers 
in increasing order, and a linear sweep would thereby be effected. 

Fig. 3 shows the details of one switching network which can be used for 


xy 





Fig. 3 — Code conversion network, 
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such code conversion. Two AND gates and one OR gate per binary unit 
are required, with the exception of the input to unit 1, which requires 
no alteration. 


III. OPTICAL CONSIDERATIONS OF DIGITAL LIGHT DEFLECTION 


3.1 Incorporation of Lens System 


Up to this point, consideration has been limited to plane waves; how- 
ever, a higher density of resolvable positions is achieved with focused 
light. If an 2x-deflection bank and a y-deflection bank are incorporated 
into the object and image spaces of a lens as shown in Fig. 4, it is possible 
to focus an aperture into the image plane in a rectangular array of posi- 
tions. The extra modulator which precedes the y-deflection bank in Fig. 
4 insures that the input polarization to the y bank is in the x direction. 
To accomplish this, the input to the modulator, 7, must then be given 
by 


a = Daven Ai 3 225 eae Si) 


where Seven 1S a Symmetric function whose value is one if an even number 
of the indicated x variables have the value one, and zero otherwise. Thus, 
the deflection in the y direction, dz, is related to the y variables in the 
same manner as the deflection in the x direction, dz, is related to the x 
variables. Hence the input-output relations developed in the previous 
section apply to both the x bank and the y bank. 

If there are n units in the x bank and m units in the y bank, then the 
resulting pattern is a matrix of 2” X 2” positions. The incorporation of a 


Y 
naa Zz IMAGE 
SOURCE POLARIZER - LENS PLANE 
x 











g 
g 
j 
APERTURE Jf fief Fo FJ Jone 
Ly Zo Ih T Us Ye Um . 


INPUT TO RECTIFYING MODULATOR, T=Seven (243 Lo,--52n) 


Fig. 4 — Incorporation of x bank and y bank into lens system. 
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lens into the system causes the rays through the system to have a range 
of angles with the z axis. Since the deflection in the uniaxial medium is 
angle-dependent, the maximum angle must be kept small. The optical 
modulators also limit the range of angles that are tolerable. . 

For simplicity in the following discussions, the distance from the aper- 
ture to the lens is assumed equal to the distance from the lens to the 
image plane. That is to say, the optical system has unity magnification. 
This simplifies some of our considerations with little loss in generality, 
and also unity magnification probably would be used in any practical 
system. 


3.2 Diffraction Effects 


Because of the anisotropy of the uniaxial crystals and possibly the 
modulator crystals, we require a high f number for the system. However 
the minimum resolvable separation between spots is roughly proportional 
to the f number. If the ratio of the focal length of the lens to the diame- 
ter of the lens opening is 15, that is, f/15, then the maximum angle any 
ray through the system can have with the axis of symmetry is 3/7 
degrees. The following calculations are based on this compromise. 

We define the crosstalk ratio to be 


C = 10 logi [P(0)/P(d)] (9) 


where P(Q) is the integrated intensity falling on a circle of radius a, the 
radius of the aperture, centered on the image and P(d) is the inte- 
grated intensity falling in a circle of radius a separated from the image 
by a center-to-center distance of d. Fig. 5 shows curves of constant cross- 


0.8 


9° 
to) 
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RADIUS OF APERTURE,d,IN MILS 
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3.0 
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Fig. 5 — Radius of aperture vs center-to-center spacing at constant crosstalk, 
f/15 and 6948 A. 
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talk ratio plotted with a as ordinate and d/2a as abscissa. [ig. 6 is a plot 
of the loss from the integrated intensity admitted by the aperture to that 
falling on a circle of radius a centered on the image. These data were 
obtained by numerically evaluating the integrals derived in Appendix B. 
Vigs. 5 and 6 are given for f/15 and = 6943 A. Plane wave illumination 
of the aperture was assumed, and small angle approximations were made. 

From the standpoint of crosstalk, no optimum seems to exist, although 
we find that for a = 0.3 mil and d = 1.1 mils, the value C = 20 db re- 
sults. Therefore we find that the spot density, D, can be made as high as 


1 
~ (11 x 10-3)? 


at f/15, and 6943 A. Fig. 6 indicates that the loss would be 11 db for this 
case. 


D = 0.826 < 10°/in” 


3.3 Refractive Index Effects 


In general, a deflected beam passes through some of the uniaxial crys- 
tals as an extraordinary beam; hence it encounters a different refractive 
index from the undeflected beam which passes through all crystals as an 
ordinary beam. As a result the deflected image of the aperture will be 
formed at a different z coordinate in the optical system than the unde- 
flected image. For paraxial imaging, which is consistent with our assump- 
tion of a fairly large f number, a calculation yields that this shift in the 
image is given by 


ne Nn 


where Za is the z coordinate of the deflected image and Zp) is the z co- 








LOSS IN DECIBELS 








ie) 0.2 0.4 0.6 0.8 1.0 
4 IN MILS 


Fig. 6 — Loss vs radius of aperture at f/15 and 6943 A. 
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ordinate of the undeflected image. Here z is positive in the image space 
and is measured from the center of the lens. Also, dz and dy are the a 
and y displacements of the deflected image. In (11) 


2 2\ 3 
ae (tm) . (12) 
Ne => Qnone /(no + Ne) (13) 


and for the orientation y of the uniaxial crystal which gives the largest 
deflection (see Appendix A) 


cot PW = 2nmo/ (mo — ne). (14) 
The refractive indices of calcite at 589 my are” 
ny = 1.658038, Ne = A864 (15) 
hence from (11), (12), (13), (14) and (15) 
AZ = 0.36 dz + 0.36 dy. (16) 


We should note that since this is indeed the equation of a plane, we 
have merely to tilt any target to be used at the appropriate angle to get 
a sharp image at each of the possible positions of the spot. 


IV. OPTICAL MODULATORS 


Two possible ways of achieving 90° rotation of the plane of polariza- 
tion have been investigated. A high specific rotation is possible in yttrium 
iron garnet in a magnetic field due to the Faraday effect. The necessary 
fields are quite high, however, for 90° rotation in a crystal of reasonable 
thickness; moreover, YIG has high optical attenuation at the wave- 
lengths for which the specific rotation is greatest.3 

Due to the linear Pockels effect, a ZO plate of IKH2PO. can cause 90° 
rotation with the half-wave voltage applied in the z direction if the in- 
duced principal axes are at 45° to the x and y directions.* Since KDP is 
a ferroelectric, the half-wave voltage is directly proportional to the tem- 
perature above the Curie temperature. 

Fig. 7 defines the dimensions and orientation of a KDP crystal to be 
considered for a modulator. 

The clamped dielectric constant, loss tangent, and half-wave voltage 
in the z direction are,” 


_, 2.27 X 10° + 4.77 


T — 119 i) 


€ 
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-1 
tan 6 & 58 (18) 
Vype & 42(T — 119) volts. (19) 

The stored energy and the dissipated energy per cycle are 
Ws = 3CV i 2 (20) 
~~ {1.77 + 3.67 X 10°7} (7 — 119) X 10° "(ab/c) joules (21) 
Wais = 27Ws tan 6 (22%) 
~~ {0.938 + 1.94 X 10°°7T} X 107°(ab/c) joules (23) 


where the dimensions a, b, and c are the dimensions of the crystal in the 
x, y, and z directions, respectively, and are understood to be in centi- 
meters. 


INDUCED 
PRINCIPAL 





Fig. 7 — Dimensions of KH:PO, modulator. 


In the section on diffraction, we noticed that the center-to-center 
spacing of the spots would have to be greater than about one mil. If 
there are eight binary units in the z-deflection bank and eight in the y 
bank, then a square array of 65,536 positions results, 256 on a side. If 
the incremental spacing is one mil, then with some loss in intensity to the 
spots on the edge of the pattern we could choose the diameter of the 
lens opening, and hence a and b, to be 1 cm. 

For this relatively small number of positions, and for a KDP modulator 
as thick as 1 cm, the dissipated energy per cycle will be on the order of 
one microjoule. To reduce the stored energy per cycle to this value, we 
would have to cool the modulator to about 6° above its Curie tempera- 
ture. Thus a KDP modulator has a severe power limitation for nominal 
system capacities, if operation above | mc is desired. 


V. EXPERIMENTAL RESULTS 


In order to examine the optical limitations of digital light deflection, a 
four-unit system has been constructed. Fig. 8 isa pictorial diagram and 
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Fig. 8 — Pictorial diagram of four-unit deflection system. 


Fig. 9 is a photograph of the experiment. The essential elements are: a 
50-mil point light source which illuminates a 1.0-mil diameter aperture, 
a 65-mm lens which focuses the aperture at unity magnification into the 
image plane, a four-unit deflection bank, and a microscope for viewing 
the real image of the spot in the image plane. The four-unit deflection 
bank consists of four rotators and four calcite disks. The rotators were 
constructed by aligning two disks of Bausch and Lomb quarter-wave 
plastic. The rotators were mounted so that they could be rotated through 
45°. In one orientation the principal directions of the so-constructed half- 
wave plate were parallel to the two possible polarizations of the beam 
and there was no rotation. In the other orientation the principal direc- 





Fig. 9 — Photograph of four-unit experiment. 
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tions of the half-wave plate were at 45° to the possible polarizations of 
the beam, and the polarization of the beam was rotated by 90°. 

The system was designed to give 16 positions of the 1.0-mil spot, with 
center-to-center distances of 2.5 mils. This design called for calcite disks 
of 0.0266, 0.0452, 0.0903, and 0.1806 in. thick. The disks obtained were 
0.020, 0.045, 0.090, and 0.180 in. thick, the largest error occurring in the 
thinnest disk. 

It was found that, for the maximum deflection of nominally 37.5 mils, 
the dispersion of the calcite in the deflection was about 1.0 mil over the 
visible spectrum. The half-wave plates were found to pass appreciable 
amounts of the unwanted polarization at the red end of the spectrum, 
and the dichroic polarizer used was known to be less efficient at the blue. 
For these reasons, the cleanest spot pattern was obtained with a green 
filter. It was further found that the relative intensity of the diffraction 
rings increased appreciably above an f number of 22. Figs. 10 and 11 are 
photographs of the spot pattern for various modulator settings with 
white and green light, respectively. The nomenclature 2x72%374 has 
the same meaning as that described above. These photographs were taken 
through the microscope at f/22 and f/16 of the lens and camera respec- 
tively, and the exposure times are indicated. The photographs show that 
the error in the thinnest calcite disk causes a noticeable “pairing” of the 
spots. . 

In order to provide a check on the diffraction analysis, we set the lens 
opening to f/15 and used a 666-my filter. The spot pattern was enlarged 
six times, and a 6-mil diameter aperture was centered at the position of 
the first spot in the new image plane. The light transmitted through this 
aperture was then detected by a photomultiplier. With this arrangement, 
x4 was alternated, and the resulting change in photomultiplier current 
noted. These first two spots are ‘‘paired,” as may be easily seen by in- 
specting Figs. 10 and 11. We measured their center-to-center spacing in 
the red light to be 1.04 + 0.25 mils. For f/15, \ = 694 u, andd = 1.04 
mils, the predicted value of the crosstalk ratio is 12.8 db, whereas the 
measured value was 10.2 db. 


VI. APPLICATIONS 


There are many functions that could be performed by digital light de- 
flection. We shall briefly discuss three: a semipermanent memory, a pulse 
code modulation (PCM) decoder, and a digital delay line. In these appli- 
cations, deflection banks similar to those described above can be used. 
The principal differences reside in the numbers of possible positions of 
the beam and the type of target used. 
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X1,Xo,X%3, Xyq IN 
INTERMEDIATE SETTINGS 
20 SECONDS 





X,, Xo, %3,X4 = 0000 X,, Xo,X%3, X4 = 1010 
5 SECONDS 5 SECONDS 





X,,X2,X3,X4 = O10! X15 X21X3q X4 = 1000 
5 SECONDS § SECONDS 


Fig. 10 —- Microphotographs of spot patterns with no filter. 


6.1 The Semipermanent Memory 


Digital light deflection may be used to provide access to a target where 
information is stored in the form of a matrix of potential light paths. 
We place a photomultiplier tube, or some other photosensitive device, 
behind the target. When one of the possible combinations of inputs is 
applied to the modulators, the beam falls on the position of the target 
corresponding to that combination. If the target is transparent at that 
position, then we get an output from the photosensitive device which 
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Fig. 11 — Microphotographs of spot patterns with green filter. 


might represent a stored “‘one.’’ If the target is opaque, the photomulti- 
plier does not respond and a stored ‘‘zero”’ is inferred. A simple target 
could be a card with holes punched through it at some of the positions 
in the matrix. Since we could substitute cards with different information 
content, this is a semipermanent memory. In a memory application, we 
desire a very high storage capacity, and hence a matrix consisting of very 
many positions of the beam. We have shown in our section on diffraction 
that a high density of positions is possible; however, it then becomes difhi- 
cult to position the target accurately, and this is an inherent problem 
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with the device. This application has the virtue that raw binary numbers 
serve as inputs to the storage element, and these are the type of signals 
most immediately available in computing machines. 


6.2 The PCM Decoder 


In the application of digital light deflection to the decoding of PCM 
signals, the target consists of an array of 2” light paths, where n is the 
number of bits in a code group. The transparency of the light paths is 
quantized so that the amplitude of the output signal is different for each 
position. The target could be a positive photographic plate partially 
exposed at each of the accessible positions. The deflection banks could be 
conveniently used to expose the plate. The output of the photosensitive 
element would then be quantized PAM, and we recover the original 
analog signal by the usual method of passing the PAM signal through a 
low-pass filter. The target positioning problem is alleviated in this appli- 
cation because the capacity of the system is low. 


6.3 The Digital Delay Line 


Fig. 12 shows an 2-deflection bank, quarter-wave plate, polarizer, 
delay line, analyzer, and photosensitive element arranged in line. The 
«-deflection bank enables us to displace the light beam in a digital man- 
ner along the x axis. Since the output light from the deflection bank is 
plane polarized, and the plane of polarization alternates from one posi- 
tion to the next, we use a quarter-wave plate and a polarizer to fix the 
plane of polarization that falls on the delay line. At time zero we generate 










Y 
aa | 
TRANSPARENT 
Zz ULTRASONIC PHOTOSENSITIVE 
DELAY LINE ELEMENT 
X DEFLECTION BANK m4 
aie I ( 
I 
if 
‘WV ABSORBER 
Xx, Lz Xm QUARTER _” } = ANALYZER 
WAVE PLATE (~) 


POLARIZER~~ 


Fig. 12 — Application of digital light deflection to a digital delay line. 
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an ultrasonic wave in the delay line. In the absence of such a disturbance, 
the delay medium is transparent and isotropic, but as the disturbance 
passes through the position of the light beam the strain birefringence 
causes the light beam to be elliptically polarized. The analyzer is crossed 
with the polarizer and hence only transmits light to the photosensitive 
element when the ultrasonic wave passes through the position of the 
beam. Then we get an output from the photosensitive element. Thus the 
time at which an output results is proportional to the displacement. of 
the beam, which in turn is determined by the state of the input variables. 
Thus digital light deflection can be used to make an electrically variable 
digital delay line. We should note that for a maximum delay line length 
of one inch the range of the device would be approximately 10 usec, since 
sound travels in solids at roughly 0.1 in. per usec. The time resolution 
would depend on the bandwidth of the delay line and the shape of the 
amplitude over the cross section of the light beam. 


VII. CONCLUSIONS 


A simple method of deflecting a light beam in a digital manner has been 
demonstrated. The method has the virtue of requiring only n two-level 
inputs for a set of 2” possible outputs, and thus the inputs to the deflect- 
ing mechanisms are binary signals. Deflection rates above about 1 me 
and with reasonable power requirements will depend on the development 
of optical modulator materials substantially more efficient than KDP. 
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APPENDIX A 


Fig. 13 shows the angles and directions in the uniaxial crystal with 
which we shall be concerned. The faces of the crystal are normal to the 
z axis; the optic axis of the crystal lies in an xz plane and at an angle 6 
with the z axis. The incident light is assumed to be a plane wave propagat- 
ing in the z direction. In general the beam will be split into the ordinary 
ray, which passes through the crystal in a straight line, and the extra- 
ordinary ray, which is displaced. 

Fig. 14 is a diagram showing the geometrical configuration of the wave 
normal, s, and the optic axis, c. We have chosen to solve the more general 
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Fig. 18 — Angles and directions in uniaxial crystal. 





Fig. 14— Wave normal and the erystallographic direction in the uniaxial 
media. 


problem since the added complexity introduced is not great. In the uni- 
axial medium, we have’ 


Va = Vi = Vo = c/n (24) 
and 
Ve.= Ve = c/n. (25) 
Making use of Fresnel’s equation of wave normals, 
Sa Sp Se 


——* et + = 0, (26) 


V,-Vée Vez —Ve' Ve— Ve 

where V, is the normalized phase velocity, we find the two solutions 
Vp = Vo (27) 
Vp = Vo cos’ @-+ V¢' sin’ 8. (28) 
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Born and Wolf’ give the relations between the components of the ray 
vector and the components of the wave normal 


2 
_ &k 2 g 
t= (V+ prea) (29) 





where 
g = Vr (V, —_ Vp) 
1 (30) 


Sa Sp } Sc ) ; 
(siya) + (xt) + ae 


Substituting in V, = Vo we find that 


g =0 (31) 
Ve = V; = Vo (32) 
th = Se. (33) 


Thus, for the ordinary ray, the ray vector and wave normal coincide. 
If we substitute our second solution for the phase velocity into (29) and 
(30) we find 

> = (V2 — Vy’)’ sin’ 6 cos’ 6 (34) 
_ Ve sin’ 6 + Vo’ cos’ 0 

V2 sin? 6 + V,? cos? 6 
by ke Bk | Ye t¥e — V;.") sin’ @ + Vo(Vo — Vi) cos” | (36) 
eV Vy V2 sin’? 6+ Vi? cos? @ — Vi? 


a 
I 


(35) 


This is as far as we care to carry the general case. For our more special 
consideration, let the 6 axis of Fig. 14 coincide with the y axis of Fig. 13; 
of course, the c axis is the optic axis, and the wave normal § points along 
the z axis. Thus § lies in the ac plane so that 


Sa = sin 8, Sy = 0, Se = Cos 6. (37) 


If ¢ is the angle between the ray vector and the c direction, then by use 
of (36), we find 


tang = (V//V_) tan 6 = (n/n) tan 8. (38) 
Then the tangent of the angle between the ray vector and the z axis will 


be 


tan g — tan 6 


tany =t SY) Pee A ee 
BW an he Or eaten 


(39) 
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2 
( — 1) tan 6 
n 
3 eet (40) 


No” 
1 + ne tan? 6 


Also, (35) becomes 


1 has 1 . ? (41) 
ne sin 6+ Pe: cos 6 
e 0 


hence 


2 
1+ a tan’ 6 
ne = ——“2___, (42) 
a + at tan? 6 
Ne Ne! 
where n, is the refractive index for the extraordinary ray. By setting the 
first derivative of tan y with respect to tan 6 equal to zero, we may find 
the orientation which gives the greatest displacement of the extraor- 
dinary ray. This results in 
Ne 


a (43) 


tan 0 


2 2 
No ro Ne 
2NNo 


_ 2nerne 
m= MW et ne ao) 


If ¢ is the dimension of the uniaxial crystal in the z direction, then the 
displacement, d, of the extraordinary ray will be 


2 2 
i (Ho) (46) 
2neno 
if the crystal has the optimum orientation. 
We may note that in the negative uniaxial crystal, where mo > n., 
the extraordinary ray propagates away from the optic axis, whereas the 
situation is reversed in the positive crystal. 


tan y (44) 


APPENDIX B 


The light incident on the aperture, of radius a, is assumed to be a plane 
wave with wave number k = 27/) and amplitude A. The distance S 
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from an clement of area in the aperture to a point, P, on a sphere of 
radius /2? is expanded in a Taylor series expansion about the radial dis- 
tance, p, from the center of the aperture (and sphere) to the clement of 
area in the aperture. 


S = R— psiné@cos®::-. (47) 


Here @ is the angle between the radius vector to the element of area on 
the sphere and the cylindrical axis, Z; and ® is the angle between the 
plane containing R and the cylindrical axis, and the radius vector to the 
clement of area in the aperture. Fig. 15 shows these elements in their 
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Fig. 15 — Angles and dimensions used to determine E(@). 


proper relation to one another. If the incident light has an electric field 
vector of the form 


E = Ae *” (48) 


then the electric field intensity on the sphere as a function of @ is, for 
small 6 


E(8) = a [ A sa — psin 0 cos &)] 


— psin 6 cos ® pean ae ey) 


Since p sin 6 cos ® «< RF, only the phase contribution is significant. 
ae —ikk Qn 
LO) = | [ p exp (ikp sin 6 cos ®) dp d®. (50) 


Equation (50) is a standard integral with the value 
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£(8). = (51) 


R ka sin 6 
Since the case of unity magnification is to be considered, it is assumed 
that the only significant function of the lens would be to cause the image 
and object spaces to be mirror images of each other if there were no stop 
at the lens. Thus, we have a section of a spherical wave to consider with 
a known phase and amplitude distribution. Fig. 16 shows the angles and 
distances under consideration. The angle @ retains its original meaning 
and ® is once again a dummy variable; p now is the distance from the 


ka’ Ae **” (7 (ka sin *) 


_- SPHERE OF RADIUS R,WITH 
w77 CENTER AT O' IN IMAGE PLANE 





Z AXIS 
Fig. 16 — Angles and dimensions used to determine E(p). 


center of the image to the point in the image under consideration. Here 
S is the distance from an element of area on the sphere to the point, P, 
in the image plane, and is expanded about p. 


S= R-—psiné@cos@+-:--., (52) 
Hence the intensity at p is 


k i‘ ** (6) exp[—ik(R — psin@ cos &)| 
0 


E = — - 
(o) Qa 0 R — psin#@ cos ® 


R sinédod®, (53) 





The approximation is once again made that p sin 6 cos® < R: 





kRe*® 8m 2a 
| | E(@) exp (kp sin 6 cos ®) sin 6d6d® (54) 
Tv 0 0 


E(p) = 5 


and, putting in the previously determined expression for £'(6), 
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6 Qa : 
a i J1(ka sin 6) 
E = k 24 12kR i 1 
(p) = (ka)"Ae fe eas ee ies 


-exp (ikp sin 6 cos ®) dé d® 


and integrating first over ®, 


On 
E(p) = (ka) Ae” / Ji(ka sin @)Jo(kp sin 6) dé. (56) 
0 


The integral in (56) is not available analytically; hence we resort to 
numerical procedures. The f number of the system and 6,, are intimately 
related, 


1 
tan On = iF (57) 


and for small angles, 


1 
Om FZ TF (58) 
Since we wish to evaluate the crosstalk ratio, it is also necessary to 
integrate [#(p)]’ over a circle of radius a located in the image plane. 
Tig. 17 defines the center-to-center separation as d and the local cylin- 
drical coordinates p’ and y. We find by the usual geometrical considera- 
tions that 


p = [d’ — 2dp’ cosg + p”). (59) 


However, no approximations are in order here, as p’ and d are of the same 
order of magnitude. Therefore the integral 


2Qr a 
P(d) = | [ p E*({d’ — 2dp' cos g + p”I') dp’ de (60) 
0 “0 


results in the power falling on the circle of radius a and being displaced 
from the center of the image by distance d. The crosstalk ratio, C, will 





Fig. 17 — Circle of integration for determining crosstalk ratio and loss. 
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be defined as 


_ P(0) 
Also, we are in a position to evaluate the loss through the system 
na A 
L= 10 logio (HA) db, (62) 


that is, the db ratio of the power admitted to the system by the aperture 
to the power falling on a circle of radius a centered on the image. The 
amount of light lost from that emitted by the source depends strongly on 
the type of source and focusing at the aperture. If the source is focused 
on the lens system in the absence of the aperture, then our equations are 
still good approximations, although such illumination would not be 
plane. 
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Perturbation Methods for Satellite Orbits 


By F. T. GEYLING 


(Manuscript received October 14, 1963) 


The literature in astrodynamics abounds with perturbation techniques 
for satellite orbits. Various formulations have been generated in terms of 
orbit elements, the satellite position and velocity vectors, or combinations 
thereof. The computational effectiveness of any perturbation scheme depends 
largely on the definitions used for the dynamic state variables. Some meth- 
ods are aimed at long-range predictions and orbit lifetime studies, others 
at short-range predictions for guidance. This paper may serve as an intro- 
duction to this field for the nonspecialist, in that it reviews the classical 
variation-of-parameters technique and discusses several engineering analy- 
ses that were generated in the post-Sputnik era. It also points to some con- 
nections between these relatively simple approaches and more elaborate 
methods of celestial mechanics. Thus it may contribute toward a comparison 
of several ‘“professtonal’’ approaches whose relative merits are often de- 
bated among experts. 


I. INTRODUCTION 


This paper is a discussion of various perturbation techniques for 
satellite orbits which were investigated by the author and his colleagues 
during the past few years. The effort began with a tutorial ‘orbit 
seminar” several years ago and it seemed appropriate to collect some 
of this material here as a companion paper for R. B. Blackman’s ‘‘Meth- 
ods of Orbit Refinement.”’ 

It is a symptom of our times that aerospace engineers are taking a 
new look at the established methods of dynamical astronomy. The 
orbital geometries and vehicle characteristics encountered with arti- 
ficial celestial bodies often require departures from the formulations of 
classical astronomy and, in fact, have stimulated several new (or at 
least independent) approaches during the post-Sputnik era. The number 
of publications in this time has been formidable, and in many discus- 
sions the names attached to various formulations serve as passwords 
for the ideas they represent. The uninitiated find themselves at a loss 
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concerning the methods that stand behind these names, their degree of 
originality, and their relations with each other. 

In view of this situation the following article is addressed to two kinds 
of readers: 

(t) The newcomers in the field of orbital mechanics who seek a tutorial 
survey and an introduction to some of the literature. A bare minimum 
of definitions is given for their benefit; a discussion of basic order-of- 
magnitude relations and certain intuitive notions which would strengthen 
the beginner’s grasp of the physical problem had to be omitted for lack 
of space but can be found in the literature.” 

(iz) The specialists in orbital mechanics who have not had occasion 
to correlate some of the better-known contributions in the literature 
and who may find this work a step in that direction. Typical issues 
in such comparisons are the choice of coordinates, the accuracy and 
elegance achieved by various transformations of the variables, and the 
precision obtainable from series expansions of the solution in terms of 
various small parameters. 

The simultaneous need for conciseness of presentation and discussion 
of certain analytic detail presents somewhat of a dilemma. As a com- 
promise, much of the development between the explicitly quoted re- 
sults is covered in a descriptive way and the reader is referred to the 
literature for all standard derivations.1*+ Most of our discussion con- 
cerns orbits of moderate eccentricity, which are representative of satel- 
lite missions. However, in many places an extension to the highly ec- 
centric orbits of space probes follows readily. 

We begin by devoting Section II to a statement of the fundamental 
equations of motion, the definition of so-called orbit parameters, and a 
description of various disturbing functions. Section IJI summarizes 
the classical treatment of satellite perturbations as gradual changes 
of the orbit parameters. (‘rom a general point of view, this formulation, 
due to Lagrange, is derivable from the canonical systems governing the 
satellite problem.) It is hoped that this covers a sufficient amount of 
standard material to introduce the concepts and the parlance of orbital 
mechanics. 

In Section IV we examine several perturbation methods for aero- 
space applications which are based on variously defined spherical and 
moving Cartesian coordinates. This includes the well-known contri- 
butions by Bhtzer et al., Anthony et al., and Roberson. They could 
serve as an introduction to the discussion of more elaborate formula- 
tions by King-Hele et al. and Brenner et al. In Section V we treat one 
more formulation in this general category which was specifically de- 
signed for guidance studies. 
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A logical continuation of this paper would cover the methods of 
Breakwell et al. and Diliberto, Kyner’s averaging technique, and the 
one suggested by Struble. Ultimately the hierarchy of perturbation 
methods leads to the Hamilton-Jacobi techniques expounded by Brou- 
wer, Garfinkel, and Vinti. These represent a very popular approach to 
higher-order perturbations and the coupling between simultaneous 
disturbances of satellite orbits. 


II. PRELIMINARIES AND DEFINITIONS 


We remember that the underlying phenomenon of undisturbed satel- 
lite motion (in a central force field, i.e. around a spherically symmetric 
body) is Newton’s law of inverse square attraction. In a Cartesian co- 
ordinate system this spells out to be 


@= — (Gmz/r’) (1) 
gj = — (Gmy/r’) (2) 
2 = — (Gm.z/r’) (3) 


where G is the universal gravitational constant, m. the central mass, 
and we shall usually take Gm, = k for brevity. m, shall be the mass of 
the earth in all our discussions. [Strictly speaking, formulas (1) to (3) 
should show the sum of m, and the satellite mass instead of just m, .| 
r is the distance from the origin, and dots indicate time derivatives. 
The x-y plane is usually taken to coincide with the equator, while the 
positive x axis points to the vernal equinox. The above equations 
simply state that each acceleration component is due to the correspond- 
ing component of the gravitational attraction — the minus signs indi- 
cating a direction toward the origin. The solutions of (1).to (3) are 
the well-known Kepler orbits — ellipses, parabolas, and hyperbolas. 

Such orbits can be conveniently described by a set of six parameters 
that give the plane of the motion, the shape of the orbit and its orien- 
tation in that plane, and the timing of the satellite motion along this 
path. These quantities may be considered the constants of integration 
for a solution of (1) to (3). A standard set of such orbit elements for 
elliptic motion is illustrated in Fig. 1. They are: 


a, the semimajor axis, 

e, the eccentricity, 

w, the argument of perigee, (4) 
1, the inclination, 

Q, the nodal angle, and 

7, the time of perigee passage. 
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Fig. 1 — Standard set of orbit elements for elliptic motion. 


The last quantity establishes a time scale for the entire motion in that 
it serves as “epoch” and fixes one particular passage through perigee. 
If the satellite has swept out the angle f since that passage, the elapsed 
time is given by 


ee aa Pd He Vee ct 
t—7r = (a'/k) {2 tan (=) tan | 


. f 
= 1-2) sin f 
@ ( e) 1 + e cos fJo 
where ¢ is the time pertaining to the position 0’. f is known as the true 
anomaly and (5) holds for all values of this angle. If we set f = 27 
this corresponds of course to a full revolution around the orbit, and the 
clapsed time interval is 





(5) 


T = 2x (a°/k)’, (6) 


which is known as the anomalistic period. The instantaneous position 
0’ can also be defined in terms of other angles, the so-called eccentric 
anomaly / or the mean anomaly M, which will be defined later. They 
can be related to time in similar ways. 

The parameters in (4) represent a typical set of orbit elements. The 
position and velocity vectors at some epoch are an alternative suggested 
by (1)-(8). Most astrodynamical theories use variations and combina- 
tions of all these, but from the general standpoint of analytical dynamics 
most sets of six parameters (if they are independent of each other) may 
be regarded as sets of canonic variables. Before we proceed to detailed 
formulations we examine briefly the various physical disturbances which 
cause these parameters to change in time. 


PERTURBATION METHODS 851 


2.1 The Effect of Extraterrestrial Gravitation 


If we consider the attractions from masses other than the carth we 
speak of “extraterrestrial” gravitation. In the presence of a disturbing 
body P, (1) becomes 








= —- _ — Gm, @ aa + 2) (7) 
where 
M, = the mass of P, 
rip = [(@ — ty)’ + Y — Yo)” + (2 — 2%), 


the distance from the satellite to P, and 
Epi [ep F Us oT 2515 


the geocentric distance of P. 

The corresponding equations for 7 and 2 are obvious. Now it is often 
convenient in analytical dynamics to express the disturbing terms in 
#, j, 2. as partial derivatives (OR/dx), (@R/dy), (OR/dz) of a disturbing 
function R. For the present case we would have 


r= Gity [2 __ tXp + YYe + “| (8) 


Tip es 





as can be easily verified. 

We see that the ratio of the second term in (7) to the first is of the 
order m,r°/mzr,, = x. Typical values for x in the planetary system are 
equal to or less than 10”. Its smallness is vital to the entire rationale of 
a perturbation technique. 


2.2 The Effect of the Earth’s Oblateness 


The potential field for the nonspherical earth can be represented to 
various levels of accuracy by a series of spherical harmonics. If we 
restrict ourselves to terms with rotational symmetry about the polar 
axis, we obtain the following disturbing function 


2 
i Gm I 





mr 3 (1 = S sin’ ¢) 
(9) 


+ 2 (sing = 5 sin’ v) + | 


where 
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R = the earth’s equatorial radius, 

¢g = the geocentric latitude of the satellite, 
J = 1.6239 x 10°, and 

H = 6.04 X 10°. 


I 


This two-term series is sufficiently accurate for our purposes. 


2.3 The Effect of Atmospheric Drag 


The resistance encountered by a satellite from the atmosphere is a 
subject of considerable uncertainty and continued research. For one 
thing, the density of atmospheric gases as a function of geographic 
location, altitude and time is not well known; moreover, the laws of 
interaction between a satellite and this rarefied medium are incompletely 
understood. Doubts exist as to the transition from a continuum be- 
havior of the atmosphere to the gas-kinetic regime and the extent to 
which electric interactions play a role. Nevertheless, the classical drag 
law yields useful results in many cases and we shall concentrate on it. 


We let 


Fp = —(CpA/2) pv." (10) 
where 
Fy = the total drag force on the satellite 
A = the frontal area of the satellite 
Cy = the drag coefficient 


p = the atmospheric density 
Va = the satellite velocity relative to the atmosphere. 


The monotonic decay of p with altitude covers approximately ten 
orders of magnitude within typical satellite altitudes and remains the 
subject of extensive study. The relative velocity v, is simply the difference 
between the satellite’s inertial velocity vector v(“,y,z) and the rotational 
velocity of the atmosphere V = ro cos yg, where o can usually be taken 
as the earth’s angular motion (the diurnal rate) and V always points due 
east. One is frequently justified in employing an approximate vector rep- 
resentation of (10): 


Fp & (CpA/2) po(V — v). (11) 


For typical earth satellites this force is at least two orders of magnitude 
smaller than the central attraction, ie, « < 10™°. 
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2.4 The liffect of Radiation Pressure 


As the reader knows, solar illumination exerts some pressure on 
every satellite. The magnitude of this force depends on the reflectivity 
and geometry of the satellite and, strictly speaking, on the distance 
from the satellite to the sun. It frequently suffices to represent this dis- 
turbance as a constant force 8 per unit mass and to note that it is many 
orders of magnitude smaller than the central gravity force. 


III. PERTURBATIONS IN THE ELEMENTS 


The six orbit elements [see (4)] were constants for the case of central 
inverse-square attraction. However, if any additional forces act on the 
satellite these parameters will be subject to change. To emphasize their 
time dependence we might write them as a(t), e(é) etc. In fact, their 
numerical values at any time ¢ describe the ellipse the orbiting body 
would follow if all perturbations vanished as of that instant. This 
trajectory is obviously tangent to the actual flight path at ¢ and is 
known as the “osculating” orbit. The relation between the satellite 
position in the osculating orbit and in the 2, y, 2 frame follows from the 
geometry of conic section trajectories: 


2 
L= a [1 cos f + lp sin f] 
2 
y = oe [nu cos f + i sin f] (12) 
2 
z= oO ms c05 f + nz sin f] 
where 1; , 12, +++ me are functions of 27, w, and 2. Hence x, y, z are repre- 


sentable as functions of a, e, 7, w, 2, f. [As mentioned with (5), we 
could also work in terms of the independent variable / or M instead 
of f.] However, the complete definition of the osculating orbit also entails 
that 


Ox : df 
of (a,e,t,0,Q, f) dt 


oy : df 
of (a,¢,2,0,Q,f) dt (13) 


dz ‘ df 
of (a,¢,2,,0, f) dt 


ce, 
I 
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where a, e, 2, w, Q, 7 are treated as constants. In other words, the ve- 
locity as well as the position in the osculating orbit are representative 
of the actual motion. This is the full extent of the ‘condition of oscula- 
tion.”’ 

A large part of classical celestial mechanics has been based on the 
concept of osculating orbits, and during the post-Sputnik era Lagrange’s 
classical treatment of the perturbations in these elements has been 
exploited ad ultimo. Its inclusion in this article is justified mainly by the 
need for completeness in an introductory survey such as this. It also 
serves as a point of reference for the nonclassical ‘“‘perturbations in the 
coordinates” in the next section and for the Hamilton-Jacobi techniques 
frequently used by astronomers. 

In essence, the Lagrange method consists of transforming the basic 
equations 








sed Ginx ake 
rs Ox 
Gmy , OR 
sees pts! Oly 4 
y 3 cy (14) 
..  Gmz , aR 
as r = Oz 


to six first-order equations in the orbit elements and approximating 
their solutions by quadratures. Remembering that these parameters 
represented the constants of integration for the Ikepler problem, (1)- 
(3), we note that the transition to a(t), e(t), --+ is nothing but La- 
grange’s “variation of constants” designed to accommodate the terms 
[aR/d(x,y,z)] in (14). In the process of transforming (14) by means of 
(12) we avoid the occurrence of second derivatives of the orbit elements 
by demanding that 


dl | ae 
Ox Ox Ox Ot Ox Ox .» Ox . 
Le i Sle RM Ae a cal 15 
sat oh Sak aa Oe Oe (15) 


etc. This of course results in the reduction of the system of three second- 
order equations (14) to six first-order equations. Equations (15) are 
simply a restatement of (13), the condition of osculation. 

In principle, (14) could be transformed by a straightforward substi- 
tution of (12). In order to simplify the algebra, however, one may 
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symmetrize these equations to a form where all the labor reduces to the 
evaluation of such quantities as 


(2 ot Ox a) (2! ay day 2) 
0a;da; da; dai 0a: da; da; dai 


Oz 02 dz dz 
+( ray ) = fas, ah 


Jai OQ; Ox; Oa: 





(16) 


with 7,7 = 1--- 6. 
The shorthand symbol that we have adopted for this expression is 
known as a Lagrange bracket, and a; and a; stand for any two of the 
orbit elements. These brackets have the properties 
la; , a] = 0, la; ) a; = Te la;, a] (17) 
and 
d 
dl [a; ) a;] a 0, 


which make them useful devices in numerous manipulations of analytic 
dynamics. With their help the equations (14) become 





2a° Ok 
a a 
__al—eyak 1 (: - “y ar 
7 ke or e\ ka dw a) 
di 1 oR aR 
dt {ka(1 — e?)}} sinz | cos : dw 4 (20) 
- 1 ok (21) 


(ka(1 — &)} sind o¢ 


2\4 BS - afd 
ee ( = t | SE — conte 2 | (22) 


ka e| de 1—eé oO 


and a corresponding equation for 7 which will be discussed a little later. 
The five equations given here describe the changing geometry of the 
satellite orbit. All six equations together are known as Lagrange’s 
planetary or “variational” equations. 

In (18)-(22) we assumed that the perturbing forces were conserva- 
tive, ie., expressible as (dR/dx), (dR/dy), and (d@R/dz). In some 
situations, as for example in the case of drag, this is not so. Under these 
conditions it is convenient to represent the disturbing force by com- 
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ponents S, 7’, N which are in the radial direction, the direction of in- 
creasing true anomaly, and normal to the orbit plane, respectively. The 
derivation leading to (18)—(22) can be repeated with the appropriate 
modifications to yield a set of differential equations with S, T, N in the 
right-hand sides. For example 


3 
a 


a=2 las | [Sesnf+ 7 (1+ ecosf)], ete. (23) 


These are known as Gauss’s form of the planetary equations. We note 
that they contain the true anomaly as an independent variable. More 
will be said about this presently. 

It is possible to show that the planetary equations, especially in the 
last form, lend themselves to an alternative derivation which appeals 
to intuition. It is based on the idea that any continuously acting per- 
turbation may be interpreted as a sequence of infinitesimal impulses 
whose cumulative time response can be represented by a convolution 
integral. This approach leads to equations like (18)—(28) without the 
manipulations involving Lagrange brackets.® 

Inspection of (18)—-(22) shows that their nonlinear right-hand sides 
preclude an exact solution except for very special forms of R. Unfor- 
tunately, none of the perturbations encountered in nature fall into this 
category. One therefore resorts to a process of successive approxima- 
tions. 

Assuming that all disturbances represented by # are small in relation 
to the central attraction (1.e., 


aR k(a,y,2) 
(saa) [ABP = «<, 








as discussed in Section II) we consider the solution for the undisturbed 
motion, i.e. the Kepler problem, as a ‘‘zero-order”’ approximation to the 
actual case. Let its parameters be denoted a, c ---. If we insert 
them into the right-hand sides of (18)—(22), these equations reduce to 
quadratures yielding a first approximation to the effects of 2 on the 
orbit. These results are denoted a, e --- and known as “first-order” 
perturbations. We observe that (a /a\”), (e% /e), --- = O(«x). In prin- 
ciple, this process can be repeated indefinitely by substituting a” --- 
into the right-hand sides to integrate for a‘ --- . The limit is usually 
reached when the results for a, e --- have settled or human endurance 
is exhausted. (The latter constraint may be eventually eliminated by 
computer routines for symbol manipulations.) The convergence of this 
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process to the exact solution has been established by Poincaré and is 
of fundamental interest to the mathematician. Suffice it here to say 
that the ‘‘smallness” of perturbations discussed in Section II should be 
such as to justify the iterative process. 

When the right-hand sides of (18)—(22) are written out explicitly 
for any particular case, they tend to become awkward because a trans- 
cendental angle-time relation like (5) enters. Since the geometric de- 
scription of a perturbation R (or S,7',N) usually involves an angle like 
one of the anomalies very directly, it is convenient to use one of them 
as independent variable. The time relation which interconnects the 
anomalies for an osculating orbit (Kepler’s equation) can be stated in 
terms of the eccentric anomaly /, the true anomaly, f and the mean 
anomaly JZ as follows: 


Bf cits See a. affl-—e\, f]_ ed —e)sing 
Ee—esin FE = 2 tan (7 aE ‘) tan t| ~l+ecosf (24) 
(k/a’)*(t — r) = M. 





This may serve as a definition of H and M. The quantity (h/ ayi= 
is referred to as the ‘‘mean angular rate.” 
If we work in terms of the true anomaly, we can write the left-hand 
sides of Lagrange’s equations as 
da df 

i = —, — , etc. 

aie F di? ete 
where an expression must now be found for f. If we choose to consider f 
as the true anomaly in some osculating orbit valid at time ty, then it 
can be related to time by (24) in terms of the unperturbed elements 


a” = a, e” = e& ete. We call it an “unperturbed” anomaly and 
en it by f‘”. From (24) one finds 
k 3 
pO) =, Wee ee (0)\2 
f = a = a (1 + @ cos f ) (25) 


which transforms (18)—(22) to 


da” ay (1 = eo )* a 
df = —2 Ee (1 —- €9 COS ri 





as) ae — , ete. (26) 
0 


When integrated, these expressions represent first-order perturbations 
in terms of the unperturbed anomaly, i.e., a‘? (f°), ete. 

They suggest the following procedure for generating a first-order 
satellite ephemeris: if f) is the starting epoch we evaluate a” (f) ete. 
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between the limits fo” and f,"”. The time t; at the upper limit follows 
from (24) in terms of ao, e& ---. Using 4 and & = a + a(f”), 
etc., in (24), we find f, , the true anomaly for the new osculating orbit. 
Changing the notation from a to a, etc., and f; to f\, we can now 
repeat the procedure for the next integration interval. The ‘“‘updating”’ 
of orbit elements in the right-hand side of (26) amounts to a partial 
allowance for higher-order perturbations, while the recalculation of f 
at the beginning of each step represents essentially a first-order pertur- 
bation of the true anomaly. 

Instead of doing the latter by discrete increments, we can work with 
a “perturbed”? anomaly by differentiating (24) with proper allowance 
for the time dependence of a, e and 7. Using (25) one finds that, 


dp _f = 2(1 — &)'@? (34) a(l—e¢)? oR 


dfO ~ Ff = k(1 + e cos f)? \aa ek(1 + e cos f)? de 


a(1 — &) 
k?(1 + e cos f)? 


Oe Sa) ai (1 = ee sin Fens f] 


E — ¢) aR aaa (: - =) of 

asl ia some eee er lee Ce 

where (d#/da) means that £# is to be differentiated with respect to the 
semimajor axis wherever the latter appears explicitly but not when it 
is contained in n. This avoids the occurrence of a term with (t — 7). 
An expression analogous to (27) can be derived in terms of S,7',N; see 
for example Ref. 7, p. 4. Now, it is immaterial in a first-order approxi- 
mation such as (27) whether we consider f or f‘” as the independent 


variable in the right-hand side. Let us assume the former and use the 
symbol df/df = »(f). Then (26) becomes 


da” — —2[ a’ (1 — &) ] -2 af 
aS Es (1 + e cos f) , ete. (28) 


-+- ja + cos f) tan} 


(27) 





The integration procedure now runs between consecutive limits fy, 
fi, -°: f;, with updating being required only in the orbit elements. 
The corresponding epochs t; are of course computable by substituting 
a;,e; °-: f; into (24). The relative advantages of integrating the per- 
turbative equations in terms of f“ or f depend on the problem at hand. 
As we shall see later, the choice between an unperturbed or perturbed 
independent variable is available in most perturbation methods. 

Up to this point we have restricted our discussion to the first five 
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orbit parameters, which describe the geometry of the osculating orbit. 
Wherever 7 appeared, as in (24), we assumed that it would be avail- 
able from a suitable sixth equation. This parameter is needed to corre- 
late the independent variable, such as f, with time. 
An equation for 7, corresponding to (18)—(22), can be obtained by 
the process outlined before, which yields 
2a aR , a(1 — e) oR 
oe ee 2 
? k oda T ke de * (29) 
Sometimes it is convenient to work with the slightly different parameter 
x = —nr. The differential equation for it reads 


i ,oR 1-—¢é ar 
~ e(ak)t de” 





= —2(a/k)* —— (30) 
[A superficial comparison of these two equations gives the startling 
impression that (30) is obtained by multiplying (29) with —n, thus 
neglecting the 7 term that should appear. This term is really absorbed 
in the difference between (dR/da), and (dR/da),, as implied by the 
tio equations; i.e. partial derivatives of & with respect to the semimajor 
axis, holding 7 or x constant as required.| 

One could transform (29) and (30) to f (or #) as independent 
variable and pursue the quadrature as we did before. However, since 
dR/da involves df/da (or dE /da), we notice from (24) that this in- 
troduces the factors tan’ {[(1 — ¢/(1 + ©] tan (f/2)} (or EZ) 
into the integrands for r (or x‘). They can be quite awkward. 

Several devices have been developed to circumvent this difficulty. 
According to one approach we transform (29) from 7 to WJ = n(t — 7). 
The necessary compensating factors arise thereby which eliminate all 
aperiodic terms. Transforming the integrated equation back to 7, we 








have 
3 4 2y38 72 phi 
TiN = Too -+- (wy = nN) + joa [ 
k Jf (0) 
3k} 1 — ef ak 
. p)\—2 } =) LE en 
(1 + e cos f°’) 2a, + aia bes (31) 
or 
+ 2(ao/k)? & af af” 
where 
ny = (k/a,)', no= (k/a0')', 71 = To+ 7", & = A+ a; 


and (0R/éay) has the previously established meaning. Equation (31) can 
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be obtained in terms of the perturbed anomaly f if it is understood that 
a; in the quadrature and in @ is obtained by (28) and if the integrand 
of (31) is multiplied by 1/y». 


3.1 Oblateness Effects 


We briefly illustrate some results from Lagrange’s method. In the 
well-known example of oblateness perturbations, the first-order solu- 
tions for a, e, and z turn out to be entirely periodic and not very in- 
teresting.» The remaining elements, however, exhibit secular terms. 
Using only the J-term of (9) we find from (21), (22) and (31) 


x JR 


0, = az — es)t (cos %) (f1 — fo) + periodic terms (32) 


2 
G1 = wo — (cos %) (MQ — %) + ie (1 — Pat is) 
ao 2 


pm 2\2 
oni (1 eq”) (33) 
eet aa age(1 — of)? X p.t. 
a day eT UO can EEN 
Be. Stig ee) 
X {t(1 + e cos f)’ [1 — 8 sin’ wy sin” (w. + f)]}}) 
(34) 


+ 5 (1 — eo)? [(a1 — wo) + (cos %) (Qi — Q%)] 


SR S 65 
~ fixao (1 — eo)? (1 ee is) (fi — fo) + pat. 


where f, fi, fo represent the unperturbed anomaly. (We have omitted 
the superscript zero for convenience.) Equation (32) confirms the well- 
known secular behavior of the node. It turns out to shift westward for 
0 < % < 7/2 and eastward for 7/2 < % < mw. At % = 2/2 it remains 
stationary, as would be expected from symmetry. 

The secular component of w”, according to (33), reduces to the well- 
known term 


JR’ (5 cos’ ty) — 1) 
2a,?(1 = eo")? ; 
It represents an advance of perigee for 0 S 7 < 63°26’ and for 116°34’ < 
io S x. For 63°26’ < ip < 116°34’ perigee regresses, and at the “critical”’ 
angles 63°26’ and 116°34’ it is reduced to periodic motions (as far as 
the first-order analysis indicates). We note that the periodic terms in 
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(33) contain eo in the denominator, and we expect w” to behave un- 
stably for near-circular orbits (as one might expect for geometric 
reasons). Indeed, this singular behavior can be expected also in other 
examples, according to (19), (22) and (31). Furthermore, some diffi- 
culties will arise with small values of 7, according to (20) and (21). 
These cases of near-circular and near-equatorial orbits can be accom- 
modated by redefining the orbit elements in various ways. While such 
modified elements are less accessible to a geometric interpretation, they 
do not encumber the calculation of perturbed satellite positions as a 
function of time. For the sake of brevity we must forego additional 
details here. 


3.2 Lunt-Solar Gravitation 


We omit a discussion of &, since it shows periodic perturbations 
only. Substitution of (8) into (19)—(22) and (31) yields 


ae _ 15 mpdo eohrhe _ ay} alt = a ‘| Z 
€1 = 0 mre (1 €o ) {tan 1 + @ tan 5) Hs (35) 








+ p.t. 
where 
hy = Lit, + myp + NZ 
hy = lve + Me2Y p os Ney 
3 
a ae en ee (EY ry ee Gee 
Mel p 4/ 1 — ef 
1 3 f fi (36) 
-1 — €0 
(i *[(T eg) tong ff + me 
O, = Oy + BMevaoll(L + eo") (aa')s + CL = 601) (hea!) 
: : 2mPp> Sin to(1 — eo7)? (37) 





mW eee a f ft 
os (52) maf + 
@1 = wo + cos 49 (Qo = Q,) 


3mpAo” ee ee ee 
Tune (h— e0')'lAha’ — ha? — ry (38) 


a (ear? 3 f fi 
ops) off 


+ 
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( 


1 — e°)? : 
= eee [1 — wo + (% — Q) Cos to] 
1 


7 =t%>t+ 


ny 


| = No 3Gm,(1 = eo)’ (th = to) 
ee (1 =) + “ann F(1 + es 008 fo)? 


1 


ee a Gcomye pes (39) 
Qnehar p® 
‘ 2 2 
[2 + 30° — ans (1 + 4eo°) — ge (1 — ed) | 
lp aes 





Sire L—e\, fy” 
{tan  (; 7 *) tan as + p.t. 


The subscripts 7 in (86) and (87) denote partial differentiation with 
respect to the inclination. The secular term of e™ is a significant feature 
of our present results. It indicates that even near-circular satellite orbits 
can experience an unstable buildup of eccentricity due to luni-solar 
perturbations. The rate of this perturbation is proportional to the factor 
Myo /Mep® and is usually very small; moreover, the coordinates 2, , 
Yp, &p of the perturbing body are really time-dependent, which would 
modify the first-order result. Nevertheless, a long-period change of the 
eccentricity due to luni-solar gravitation has been observed in some 
satellite orbits. 

The explicit form of the periodic terms in a and e™ contains the 
factor 1/eo , while Q, w” and 7” contain 1/(sin 7). Again this necessi- 
tates the use of specially modified elements for low eo and 7. One set of 
elements which is particularly suited to the problem of interplanetary 
perturbations is due to Strémgren. He utilized the fact the Q, 7, w are 
nothing but a set of Euler angles orienting a system of orbital coordi- 
nates with one axis through pericenter, one at f = 7/2, and one normal 
to the orbit. The rotation of these axes with respect to inertial space 
conveys the same information as the perturbations of 2, 2, w. The idea 
is akin to Roberson’s method for anticipating secular terms in the 
perturbation of coordinates (Section 4.3 and Ref. 15). 


(1) 


3.3 Higher-Order Analyses 


The preceding examples are indicative of results to be found in the 
vast literature on perturbations in the osculating elements. We have 
merely covered the gist of this approach and several ideas which will 
be useful in the appraisal of other methods. Some of the better-known 
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contributions in terms of osculating elements are contained in papers 
by Krause, O’Keefe, Kozai, and Iszak. 

In principle the quadratures (18)—(22) and (31) could be evaluated 
iteratively to generate higher-order results. This procedure rests directly 
on Poincaré’s convergence proof, and a formal technique based on this 
approach is commonly attributed to Poisson. In several aerospace 
publications this has been done to obtain second- and third-order secular 
terms for oblateness effects. The algebraic labor is considerable, though 
typical secular terms such as (40) and (41) tend to be reasonably com- 
pact (see Refs. 7 and 9): 


@) _ On J'Reo _ (1 — 3 sin’ Zp sin’ 0){1 + ey sin (w) + 4%) 
2a3(1 = €,7)5 ‘ : ; ; : (40) 
X (5 sin’ 4) — 4) cos (wo + 4) 


Ag = Ord R' sin 2 
aa Aao*(1 =e eo" )4 


Aa 


{seo sin 2w) — 1(5 sin’ % — 4) 


: 41 
[2e0" sin 2wy — ey COS wo (Cos % -+ 4 Cos 3) (41) 


— @ SiN w (sin % — 4 sin 36)]}. 


These results represent secular increments over a 27 step in 0, the cen- 
tral angle measured from the node, where 6 is the initial value of 06. 
Considerable emphasis must be placed in the derivation of such expres- 
sions on checks from the conservation of energy and angular momentum 
and duplicate execution of the algebra. (One likes to think that more 
elaborate explicit expressions will be attainable with the advent of com- 
puter algebra.) A notable contribution in this area was made by Merson,° 
who presents second-order secular terms for J and first-order secular 
terms for the next four higher harmonics of the earth’s potential. He 
also advocates the use of “‘smoothed”’ elements which reduce the ampli- 
tude of first-order periodic terms that might otherwise be inimical to 
prediction accuracy. 


IV. PERTURBATIONS IN THI COORDINATES 


We turn now to a description of satellite motions directly in terms 
of the position and velocity vectors. While these are dynamically equiv- 
alent to the instantaneous orbit elements, we note that the time de- 
pendence of the dynamic state variables in this form reflects the anom- 
alistic motion as a primary effect. Therefore the long-time, secular 
changes of the orbit may not be obtainable with the same clarity or pre- 
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cision as in terms of the osculating parameters. On the other hand, the 
position-time history gives a direct account of the satellite motion in 
space and is useful for many aerospace applications. This prospect has 
stimulated several engineering analyses in recent years. 

To the analyst with a general background in mechanics it would seem 
quite natural to approach a system of equations of the type (14) by 
standard perturbation techniques. Thus one could assume the pertur- 
bation series 


a(t) = 0 (#) Hexv P(t) + Pe (4) +e 
y(t) = f(t) y(t) + y(t) + °° (42) 
a(t) = 2° (t) Heed) + P22 (4) +e 


and determine successively higher-order terms from the appropriate 
governing equations, following essentially Poisson’s procedure. The 
traditional Encke method pursues this line of attack. However, Cartesian 
inertial coordinates have not enjoyed as much popularity as spherical 
ones, which seem more compliant with the geometry of satellite orbits. 
In the following, therefore, we shall concentrate on reference frames of 
this general type. 


4.1 Perturbations in Equatorial Spherical Coordinates 


A rather well-known perturbation analysis for oblateness effects is 
that due to Blitzer, Weissfeld, and Wheelon.'® It uses the conventional 
equatorial spherical coordinates, r, 9, y (see Fig. 2) in terms of which 
the equations of motion read: 


_k _ 30kR 





i—re—recos eo = aie a [k — sin’y] (48) 
pe 
: (9) +7’ sing cose = — aun sin ~ COS y (44) 
d 2 2 
ai (r cos gy) = 0. (45) 


Here we have considered the first aspherical term in the earth’s poten- 
tial only. Since this is a zonal harmonic and does not contain y, the last 
equation has a vanishing right-hand side. Then a first integral of this 
equation 


r cos ey = p = const., (46) 
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Fig. 2 — Equatorial spherical coordinate system. 


representing the conservation of angular momentum about the polar 
axis, permits a change to y as independent variable. In addition, it is 
convenient to introduce the definitions 


rea, tang = JS, and [1/(r cos g)] = W. (47) 


Equations (48) and (44) now become 


9 2 2 
w’ Ae We k w+ 3JkRu | ] _ | (48) 


Plt)! pit Slit 3 
and 
” -_ QTR? Su 
FSF Tt ae 


where ‘‘primes” denote differentiations with respect to y. In this form 
they are readily accessible to a perturbative procedure. We let 


Sa.8% 4p as 


n=1 


w= ur t Tu” (50) 
sl 


WH 0? eS Sw. 
n=1 


Now we note that y in (46) was understood to represent the actual, 
i.e. perturbed, longitude of the satellite at all times. In order to make 
the connection between this perturbed independent variable and the 
time we find that 
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1 ¥ dy 1 v 1 wr 
mom 5 LG” 5 A, woe(! ~ 2pm td a 


~ p ate » Ti 


n=l 


(51) 


i.e. the time itself evolves as a perturbation series. In the zero-order 
solution of (48) to (51) the right-hand side of (49) vanishes and we get 


S® = Asin (y + 8), (52) 


where A and 6 are integration constants. On the right-hand side of 
(48) we retain the first term after substitution of S°. Then W™ follows 
in a straightforward manner. Substituting it into (51), the usual trans- 
cendental expression for time in Keplerian orbits results. It is somewhat 
obscured by the fact that its argument is given in terms of the longitude 
rather than one of the anomalies. We record the simplified form of these 
results for circular orbits, where uw” = 1/ro: 


Ww = ~ 1 + A’ sin? (p + 5)]! (53) 


@) _ To 2 1 2y4 1 
eS op a tan [(1 + A‘)? tan (Y + aN (54) 
It is important to note that the expressions (52) and (53) represent 
Keplerian (in fact circular) motion only for the unperturbed case: i.e., 
if (54) represents the entire time equation and no higher-order terms 
as in (51) exist. For any perturbed motion, where y is perturbed in 
relation to time, the zero-order terms of course retain the Keplerian 
forms (52) to (54), but they do not actually represent Keplerian motion. 

If we now proceed to the first-order solution and retain only terms of 
0(.J) in (48) and (49), we find 


_ KR? Sy 





gor 4 g@ : 55 
+ p? [1 + gs |? ( ) 
and 
wor “ti Ww” os ores” 
pil + SO 
(56) 


3hReue | es | 
PUTS PEL se 3 


Since the right-hand side of (55) contains only zero-order quantities, 
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we begin our solution there and the result is 


g® kR°A sin (y + 6) 4. 008 (yp + 6) 


pr, A{l + A? sin? (y + 6)] 1+ A? 





1 Fa tan '[\/1 + A? tan (py + 6)] (57) 


_ sin 2(y + 8) 
1+ A’ sin? (y + 6))° 


Here we do not show a complementary solution, since it is of the same 
form as 8” and can be absorbed with the constants A and 6. We could 
now substitute (57) into (56) to find WW and then use (51) to caleu- 
late the time. However, the inverse tangent in S” constitutes a secular 
term, which is considered an objectionable feature for some applica- 
tions. 

This disadvantage can be avoided by inserting an additional trans- 
formation between y and the argument of S°”. Instead of using y + 4 
for the latter let it be 


o=W~t+s (58) 


where 
healt > I, (59) 


and the A, are constants. This device is commonly attributed to Lind- 
stedt.’” To obtain the zero-order solution we need only substitute o for 
the angular arguments in (52) and (53). However the equation for S” 
changes significantly, viz.: 


—2kR* Sy 


where the primes now denote differentiations with respect to o. Thus we 
find 


gor Ee g@ -_ _ DSO” (60) 


kR’(A? cos’ « — 1 — A’) 
pArn(1 + A*)(1 + A? sin? o) 


Penn ic ae se Aa I 
pro(l + ADI an ano]>. 


S? =, A sine — 
f (61) 
— cos meee — 


The appearance of the free parameter ); in this result gives us the 
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opportunity to suppress the secular term. Thus, if we choose 


kR? 


M = Proll + Ay 


(62) 


(61) becomes 


kR2A cos o ; 
prot + A: (tan 


Here we have again omitted all terms of the same form as S“”. The net 
contribution from the terms in braces is cyclic and has the period 27 in o. 
According to (58) and (62) this amounts to a period of 


JkR? 
Qa E — rp + Ay + | (64) 


in w. In effect, Lindstedt’s transformation distorts the independent 
variable to absorb the secular effect. We shall see more of this later. 

In principle we could transform (56) to o and solve for W in a straight- 
forward manner. However, to simplify the algebra, a redefinition of W 
will be convenient. We may backtrack to the explicit form of (48) in 
terms of S, u, and o. 


a a Sorry at oft ite PA [Sie Sheu ne 
i. [gor el go a Wu“? 


_ 3kR? (0)2 1 2 (65) 
= PP U Lee 8 


_ ggg ae gorgmr a2 SOP ru 


soe (1 + A’)? tan 6] — o}. (63) 





and take 


WwW? = (1+ 8° )u®. (66) 
Then we obtain” 
Ru [1 — 28") 


(1) GQ) _ 
wor 4 Ww? = ai 


2u” 2 on) 
_ Ai [ss She gorga@: ale go yal 
where A = (1 + A’ sin’ a)’. 
Substituting (63) and ignoring the complementary solution for W“ we 


* Note that the formulas (86) and (88) in Ref. 9 contain several misprints. 
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& —kR? 
3pre(1 -b A?)3A8 


+ A°(9A‘* + 2A” — 7) sin? o + 5A* + 2A? — 3). 


[44‘(A* — 1) sin‘ o (68) 


Now it only remains to find a relation between o and time. From (46) 
it is clear that 


= ae. 2 9 -if do 
t b= 5 as cos Ca LE SD? (69) 


which we expand up to 0(./). This results in 


2 


{ = ai ayn 1a A’)? tan o] (70) 
and 
1 -" Pps ahs 2 ye af 2A = ad i 
=~ Ew, ae [A? sin 26 + 12(1 + A?)%o] 


(71) 


2 
+ |2 — 347+ - ((1 + A’) sin? « — cos’ “| 


-tan [(1 + A’)? tan a} 


and completes this analysis of near-circular orbits. 

Throughout the foregoing discussion we have used a perturbed 
coordinate, Y or o, as the independent variable. In principle, we could 
have done without Lindstedt’s device, and we could have used the un- 
perturbed longitude y‘” as the independent variable. This approach 
has the attractive feature that the zero-order solution (in terms of y°”) 
represents true Keplerian motion. The perturbed longitude could be 
expressed in terms of y as 


p= v4 Dyer). (72) 


However, if one develops the governing differential equations for S“ 
and W”, he discovers that they are completely coupled for this par- 
ticular example. This approach, therefore, loses its practical value. 

In conclusion we note that, in spite of various transformations, the 
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final results (63), (68), (71) of this analysis seem rather awkward, 
considering the fact that they represent the relatively trivial first-order 
oblateness perturbations of a near-circular orbit. This is of course due 
to the choice of spherical equatorial coordinates to represent the motion 
in a nonequatorial orbit. That disadvantage was eliminated in other 
formulations, to be considered next. 


4,2 Perturbations in Orbital Spherical Coordinates 


As the title of this section indicates, it is more natural to use the 
plane of unperturbed motion as the fundamental plane for inclined 
orbits. A typical analysis in this category is that by Anthony, Fosdick 
et al.’*"* The coordinates r, 6, 8 of their reference frame (sce Fig. 3) 
take the place of r, y, (7/2) — ¢ in Fig. 2. Theanglea = (2/2) — 
(Vig. 3) is introduced occasionally for trigonometric simplifications. 

The left-hand sides of the equations of motion of course are not al- 
tered by this change of coordinates, but the right-hand sides (represent- 
ing oblateness perturbations) acquire the forms shown in (73) to (75). 
As usual, we introduce u = 1/r and p = 7°6 = 6/u’ and change the 
independent variable from ¢ to 6. We note that @ is the perturbed cen- 
tral angle in the nominal orbit plane. Thus, the equations of motion 
become 


(pu’) + pu(B” + sin’ B) = (k/p)fl + JRuw(1 — 3 cos’ @)] (73) 
(—kJR?u/p)[(sin’ 7 sin” 6 
— cos’ 7) sin 28 (74) 
+ sin 22 cos 26 sin 6] 
(—kJR’u/p) sin’ 7 sin’ 8 sin 20 


+ $ sin 27 sin 28 cos 6] 


I 


(p0’)' — psin B cos B 


(p sin’ B)’ (75) 


where primes denote derivatives with respect to 6. We subject this 
variable to the first-order Lindstedt transformation 


6(1 + Jd1) (76) 


0 

and use the ‘‘ansatz”’ 
w= wc) + Ju? (oe) 

p(c) + Jp («) (77) 

(1/2) + JB (c). 


® 
I 
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Fig. 3 — Orbital spherical coordinate system. 


Without an excessive loss of generality we may assume a horizontal 
launch at the node and, by the definition of the @ plane as nominal 
orbit plane, the initial velocity vector vp lies in it. Thus, at ¢ = O:r = 
ro,? = 0, B = 7/2, 8 = 0, and 6 = w/ro. In general, v will be such as 
to produce an elliptic orbit. The zero-order results are 

p = Povo 
and 


u” = (k/ro’ve)[1 + e cos o] (78) 


where 


e = (rwo/k) — 1 


and has the form of a Keplerian eccentricity. As in Section 4.1, the 
zero-order solution (78) will represent Keplerian motion only for the 
unperturbed case, i.e. when ¢ = 6 = 0°. Now the first-order solutions 
follow in a straightforward manner: 


p? = (—k/2r’) (R/r)’ sin’ i {1 + 4e — e cosa — 4e cos 30 
— cos 2c} 7 
UP = (WR'/ro've') {1 + ge + sin’ i(— 3+ ge — $e’) 
— 1(e + sin’ 7) cos 26 — xe sin’ 7 cos 3o (80) 


2 + 2. 
— ze sin’ 7 cos 4c 


— [lL + 4¢ — sin” i(3 — ge + 3e°)] cos o}, 
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where we had to select the Lindstedt parameter as 

= (KR? /ro'vo*) (2 sin’ ¢ — 1) (81) 
to avoid a secular term in (80). Finally 


KR? sin 27 


ae [((1 + 2e) sino — o coso — de sin Qc]. (82) 
0 40 


(1) 
B = 


It is clear that the secular term which was absorbed by the Lindstedt 
parameter has to do with the apsidal precession. In the absence of addi- 
tional Lindstedt parameters we have no countermeasures against the 
secular term in (82), which reflects the nodal regression. (Note that the 
latter was counteracted by the Lindstedt transformation of Section 4.1, 
since it was the only secular effect to be considered for near-circular 
orbits.) 

The time equation for this example can be written in a straightforward 
fashion. Irom the definition of p it follows that 


(= JM) d 


h—tb = 
1 0 - me 


where op) and o; correspond to the time limits tf) and ¢;. An expansion 
to first-order terms yields 











1 8 =6oda 
oo Dp U 
(1) (1) (83) 
-3f's mi, AOE Jat? + — 5 tT 2 “| do. 
This leads to 
3. 3 . 
4 = To Vo —é sino 
ke? \(1 — &)(1 + e€ cosa) 
(84) 


2 tan aan o 71 
Weigle Nee) Oe 


which we recognize as being of strictly Keplerian form but in terms of 
the perturbed angle oc. Similarly 
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a R? {sn a(4 — e + 3¢ cos a) 


~~ ro) | (1 — €)2(1 + e cos )? 


1 + 2¢ + de? + 26’ 


+ @ —~athetae- x’) sin” i| 
_ sin 
(1 — e?)(1 + e cosa) 


+ é — $+ We - se) sin” i| 


[201 +e)? + fe’ sin’ 7], 4 |(; — ° | e 
= Ga 2! tan aes tan 5 as 


The set of results (78) to (85) gives a reasonably convenient descrip- 
tion of first-order oblateness perturbations which might be useful in the 
targeting and guidance of space vehicles. Extensions to near-parabolic 
and hyperbolic trajectories follow quite readily. As in Section 4.1, we 
note that the analysis might have been executed in terms of an unper- 
turbed independent variable, viz. 6‘° instead of 0, and in that case the 
zero-order solution would represent true Keplerian motion. 

The inclusion of secular perturbations in the independent variable o 
serves the same purpose as the definitions of “mean elements” intro- 
duced by Breakwell et al., by Hansen, in the von Zeipel method, and in 
modern averaging techniques. The Lindstedt transformation is not the 
most powerful device in this category but it can be extended to absorb 
secular effects in more than one coordinate. This will be illustrated in 
the next section in terms of “secular rotations” of the reference frame. 





2 — ke + 3c (85) 











4.3 Perturbations in Rotating Spherical Orbital Coordinates 


The idea of using suitable coordinate transformations with arbitrary 
parameters to neutralize secular trends was exploited in a more general 
way by R. E. Roberson.” His approach uses the orbital coordinates 7, 
6,6 [= (/2) — 6], in agreement with Section 4.2, but assumes that 
the entire reference frame will be subjected to three monotonic rota- 
tions, corresponding to three Euler angles, such that the satellite motion 
relative to this reference frame exhibits only periodic perturbations. 
This kinematic outlook on secular trends forms an interesting parallel 
to several classical procedures. Roberson himself makes some illuminat- 
ing comparisons between engineering analyses such as Refs. 9, 10, and 
12 to 16 and traditional formulations in terms of mean variables. He 
restricts his analysis to first-order perturbations, realizing that a con- 
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sistent higher-order theory would have to include contributions from 
other physical effects and various coupling terms. Some of his remarks 
seem quite perspicacious in comparison with the other aerospace litera- 
ture of that time. 

The angular velocities stipulated for the reference frame must of 
course depend on the secular effects that need to be absorbed. In the 
presence of several physical disturbances the different angular motions 
of the coordinate system can be superposed to first order, and the re- 
sultant motion of the reference frame will succeed in neutralizing all the 
secular effects simultaneously. This seems intuitively obvious and can be 
demonstrated in a straightforward fashion.” 

In Fig. 4 the angles Q and 7 define a mean orbit plane, in that each of 
them manifests a secular rate. Now the satellite position is given in 
terms of the orthogonal system %, 7, 2, which displays a secular variation 
with respect to the node (and this corresponds to the third Eulerian 
rotation). Let the three secular rotations be denoted «2, «(di /dt), 
xa” where x is the perturbation parameter. They will in general be 
functions of Q, 7 and the characteristics of the perturbation source. 

Turning to the problem of first-order oblateness perturbations, we set 
x = J and assume the appropriate form for the perturbing potential. 
Now let 


6= H+ f= +f" + Jf”, (86) 
where f = 0 at ¢ = 0. We adopt f as independent variable and let de- 





Fia. 4 — Rotating spherical orbital coordinates. 
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rivatives with respect to it be denoted by primes. Assuming that the 
secular rates are constants, we let 

&= 9 4 QF 
= { Be Ji F (87) 
6+ Jaf. 


2 


DI 
| 


As usual, the equations of motion are transformed by means of 


u=1/r and p= rf (88) 
and we find: 
(pu’)’ + pul[(s’ — & cos 6 sin 2)” 
+ (cos 6 + &” cos d + 2” cos ¢ cos v)”| (89) 
— (k/p)[l + 3JR’wW(1 — 3 sin’ g)] = 0 


[p(6’ — 2” sin ¢ cos 6)’ + p[(1 + 6)’ sin 6 cos 6 
+ (1 + '”)Q (sin 6 cos » cos ¢ + cos 6 sin ¢) (90) 


+ G* sin ¢ cos » cos gy] + (JkR*6u/p) sin ¢ cos ¢ cos vy = 0 
[p cos 6 (8’ cos 6 + 2" cos @ cos v)]! 
+ p[(1 + 4°’)& sin 5 cos 6 sin 7 cos 6 
+ §® cos 6 sin 7 sin 5 cos » cos ¢ (91) 
— 6/0 sin 7 sin 6] 
+ (6JkR*u/p) sin ¢ cos 6 sin 7 cos 6 = 0, 


where ¢ is the latitude and » is defined in Fig. 5. We have made a slight 
digression from orderly progress in this step by setting 7” = 0. This is 
prompted by previous experience with this problem — viz., that no 
first-order secular perturbations occur in 7 — and would have developed 
from the later calculations in any event. 

Using the forms 


w= Uo + Sy 
p= p” ab Jp” (92) 
= Jo 
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Zz 





Fig. 5 — Definition of ». 


we reduce (89)—(91) to equations of 0(1) and 0(J). The zero-order 
solution is of course 


0 
5 0, p” = const., 


93 
and ul = (k/p”)[1 + e cos (f — a). 098) 


As in previous examples, we see that this will represent Keplerian 
motion only in the absence of perturbations, ie. if f = f° and & = 
&” = 0, yielding an inertial reference frame. We assume that the con- 
stants of integration (p, e, a) are chosen such that (93) with f = 0 
yields the satellite position and velocity at t = 0. 

Proceeding with the solutions to 0(J) in the usual fashion, we re- 
quire that 


y 37 k?R? cos 4 
OS = =o (94) 


in order to avoid a secular term in 6” and 


Ae = (3R°K?/2p*) (5 cos’ i — 1) (95) 


to avoid one in u’. These of course reflect the nodal and apsidal pre- 


cessions. The complementary solution for p introduces one constant 
of integration, and the complementary solutions for 6° and u“ have 
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the form 
A sinf + B cosf. (96) 


Since the zero-order solution already accounts for the dynamic state of 
the satellite at ¢ = 0, the first-order solution encounters homogeneous 
initial conditions as far as they do not reflect the rotation of the reference 
frame. Thus at f = 0: 

, uy = 5 = 0, yr’ — () 


A(1) os «(0 
6! — G® gin 7 eos 0 = 


| 
= 


and 


(A/F) (20 op a yon) oe ge 


: (97) 
+ 8 (eos 7 — sine sin ) = 0. 


These govern the first-order constants of integration. [A little reflection 
shows that the forms (96) for 6° and u can be interpreted geometri- 
cally as constant changes of 2 and 6 to 0(J/). Roberson anticipates this 
by introducing such constants in (87) and using them in place of two 
of the integration constants for 6° and u“. However, nothing seems to 
be gained by this artifice and, if anything, it distracts from a systematic 
procedure. | 

Finally, the time equation follows as usual in terms of f to O(J). 
Roberson proceeds to invert it, though the computational gains do not 
seem to justify this algebraic labor. 

So much for our sketch of Roberson’s procedure. Its extension to 
higher-order analyses is fairly obvious. At every level of refinement, 
0( J”), three coordinate rotations may be introduced — which are com- 
mensurate with the three degrees of freedom of the satellite problem 
whose secular trends we are trying to neutralize. 

For ‘“medium-range’”’ prediction formulas it seems an open issue 
whether the rationale described in this section and traditional astro- 
nomical devices (like the ‘‘auxiliary ellipse’ used by Hansen or the von 
Zeipel transformations based on Hamilton-Jacobi techniques) offer a 
computational advantage over the straightforward development of the 
Poisson method for successive higher-order terms. With the advent of 
computer algebra the latter technique may be quite satisfactory for 
many applications. However, for “long-range” predictions and life- 
time studies it seems advisable to employ the accredited astronomical 
techniques of “extracting” secular effects and anticipating long-period 
terms in one way or another. 
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V. MOVING RECTANGULAR ORBITAL COORDINATES 


We close this article with a formulation which calculates the posi- 
tion offsets for a satellite from its unperturbed orbit in an explicit 
form.”*"”” Instead of reckoning the perturbations in terms of the quan- 
tities r, 6, 8 or 6, which are defined relative to the center of the 
earth, we now consider a coordinate system whose origin is the nom- 
inal satellite position 0’ on the unperturbed orbit (see Tig. 6). The 
latter may be defined by the initial conditions at time ft), viz. ro 
and vy. We establish an orthogonal triad about the moving point 0’ 
with € in the radial direction, 7 in the direction of anomalistic motion, 
and ¢ normal to the orbit plane. In the guidance engineer’s language 





Fig. 6 — Moving rectangular coordinates centered at nominal satellite posi- 
tion. 


these represent offsets in altitude, range, and cross-range. Any non- 
vanishing coordinates in this system are the effects of errors in the 
initial conditions or of geophysical forces. It is clear that this description 
of the perturbed motion can be quite useful in guidance studies, e.g. 
to exhibit the relative motion between a space station (given by 0’) 
and a transfer vehicle (located at £, 7, ¢) in a homing maneuver. In the 
subsequent discussion f will always represent the unperturbed true 
anomaly in the nominal orbit and @ = w + f the unperturbed central 
angle.” No Lindstedt transformations or perturbative coordinate rota- 
tions will be employed to develop this theory into a more sophisticated 
prediction scheme. Instead, we concentrate on the geometric interpre- 
tation of various results. 


* We depart from earlier notations by omitting the superscript (0) from un- 
perturbed quantities for simplicity. 
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The equations of motion can be derived by the standard Lagrangian 
or Hamiltonian formalism,” and their linearization to 0(£, 7, ¢) yields 


g" — Qn! — & — 2[E + e(f’ — 9) sin f]/(1 + e cos f) 


=- oe Fit ecossf 
9” + 2@ — 9 — [— 9 + 2e(y’ + &) sin f]/(1 + ¢ cos f) 
tt) Castaic OP 
gc’ + [g — 2¢’e sin f]/(1 + ¢ cos f) 
a “ e)° Fe/(1 + ¢ cos fy" (100) 


where primes denote derivatives with respect to f and V is the perturba- 
tive potential, which exists in addition to the central body term —k/r. 
The subscripts of V denote partial derivatives with respect to é, n, or ¢. 

The solution of the homogeneous set (98) and (99), where V = 0, 
represents a complementary solution for the cases where V # 0 and 
will be needed to satisfy the initial conditions. For an elliptic nominal 


orbit this solution has the form 


l1-¢ 3e k ; . 
=e ial 5 sre” 3 (r=) ee) sn] 


; (101) 
— deacosf — sin 
i sro(q—ans) sind 
ne k 
n= 3 (7 a aa) (1 + e cos f)(t — 7) 
. ~(2+ ecosf) ( k ) 
-+- 6e asin f T+ecosf. or a — &)a (102) 
a(1 — e’) 
c= eee [si sin 6 — 6Q sin 7 cos 6] (103) 
(1 + e cosf) 
This result can be adapted to hyperbolic, parabolic, and near-parabolic 
orbits without much trouble. The constants of integration 6a --- 67 are 


of course just a set of numbers to be determined from the initial condi- 
tions, but the symbols we use for them indicate the parameter changes 
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of the nominal orbit that they represent. Alternatively, these constants 
could be given in terms of & --- 0’, the perturbations of the position 
and velocity at to). That form is more descriptive for various guidance 
applications. Thus we find for nominally circular orbits 


E =2no' + 4f0 — (2n0' + 3&0) cos (f — fo) + fo’ sin (f — fo) (104) 
n= no — 28! — 3(m0 + 260)(f — fo) 

+ 2(2n0 + 3&) sin(f — fo) + 2&’ cos (f — fo) 
go’ sin (f — fo) + $0 cos (f — fo). (106) 


(Since a nominal perigee docs not exist for this case, we assume that 
the angle w, whose existence is still implied by the notation f, has some 
arbitrary value @. Without loss of generality we can set fo = 0 so that 
Ao = a.) 

In a guidance application £ --- {o’ might represent the errors result- 
ing from a position and velocity determination or a corrective thrust 
maneuver and (104) to (106) would then describe the “run-out” as a 
function of time. In an “orbit sensitivity” study these expressions can 
be used to demonstrate the effect of & --- €’ on the orbit parameters. 
In a homing maneuver the same expressions would represent the rela- 
tive motion between the two vehicles attempting a rendezvous. In 
principle, two relative position measurements X: & 7, ¢ at separate 
times suffice to determine all the constants in (104) to (106), and a 
corrective maneuver could be planned to drive the residuals to zero at a 
specified instant or by successive approximations. 

Particular solutions of (98) to (100) can be found in a straightforward 
manner if e = 0. For e ¥ 0 we construct these solutions as power series 
in e, for lack of a better expedient. We consider the series to O(e) and let 
them be denoted by 


E=ht+h+e Di; 


(105) 


is 


n=titmte Digi j= 1,2,3 (107) 
Jj 


“~ 
I 


HBtht+e Dh; 
where 


Eg = the complementary solution (104) to (106) 

Eopote = a particular solution representing V to 0(«) 

figity = solution reflecting e-(&, 71, &) on the right-hand sides of 
(98) to (100) 


I 


I 
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fogzh. = solution reflecting e-(& , 72, €) on the right-hand sides 
fsgshs = solution reflecting e-(V:, V,, V¢) on the right-hand sides. 
The following explicit general forms can be given for these solutions: 


bib | -2 7 V,af +2 cos f | ie coat a 
as Coes / Vesinyvare 2 aut / V guy ay 
~sinf [ 7; cos df 
mae E I V, df df +2 i: Vere asin [ V, cos fdf (108) 
= asin f [ Vera 4 cos f [ Vesta 
9 cosf | V cos dt 


fe = (a°/k) [cosy V;sin f df — sin f [ V; cos. df 
where we take fo = 0 for the lower limit of all quadratures. 
fi = (m0 — 2&0") sin f — 3(m’ + 2&) cos f — 3(m' + 2&0) 
-f sin f 
g. = T(o’ + 28) sin f + (m0 — 260’) cos f — 3(m" + 2) (109) 
‘feos f — (£0'/2) cos 2f — (mo" + 2&0) sin 2f 
hy = — (60/2) — (F0'/2) sin 2f — (0/2) cos 2f. 


The terms fe, go, and hy are obtainable from expressions analogous to 
(108) but with the following substitutions: 


2(£/ — 2) sinf — 2icosf for (—a’/k)V; 


2(#o’ + &) sinf + m cos f for (—a’/k)V, (110) 


2%.’ sin f + 2 cosf for (—a°/k)V; 
and fsg3h3 follow from (108) if we substitute 
(40°/k) Views for (—a°/k) Vewe- (111) 


Since the differential operators for all of these partial solutions are of 
the form 
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£” — Qn! — 3 
n” Qe (112) 
Cort, 
no explicit complementary solution of 0(e) is provided: i.e., the con- 
stants & --+ ¢9’ will be used to satisfy the i.c.’s to all levels of accuracy. 


These will differ from zero only if the nominal orbit is taken to differ 
from fp and vp at to. 

Specific results may now be obtained by the above formulas, which 
lend themselves to a geometric interpretation of perturbed satellite 
motions. For the oblateness effect one finds 


£ = (JR’/a)[—1 + sin’ 7(2 + 2 cos 26)] 

fp = (JR’/a)[(2 — 3 sin’ 2)f + a: sin’ 7 sin 26] (113) 

& = (JR’/2a) sin 2i[f cos 6 — 3 sin 6]. 
The terms in 26 reflect the doubly symmetric distortion of the orbit 
due to the oblateness of the gravitational field. The constant term in 
£, and the secular term in 7 reflect the additional mass of the equatorial 


bulge. Combining (113) with (104) to (106) into a complete solution, 
we observe that the constant term in € is 


Aa = (JR/a)[—1 + 2 sin’ @] + 2a’ + AE 
and the secular term in y (114) 
ako = f[(JR?/a)(2 — 8 sin® i) — 3( mo’ + 2)], 


which represent the differences between the nominal circular orbit 
and the mean circular orbit resulting from the perturbations. Since 
Aa = 0 and Aé@ = O do not yield linearly independent conditions for 
£ and mo’, we cannot effect a launch so that the radius and the mean 
angular rate coincide with the nominal ones (determined for a spherical 
earth) unless sin 7 = +/2/3. On the other hand, it turns out that we 
can preserve the nominal inclination of the orbit by choosing ¢) = 0 and 


to = (JR*/2a) sin 27 cos %. (115) 
Now, if we designate Af = [&l¢20", we find for the nodal regression 
a 2 
Q = Se ee = —nJ (2) cos 4 (116) 
sin 2 27a" a 


and this agrees with the well-known result. 
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In the case of drag perturbations one replaces V;, V,, Vz of (110) 
by the appropriate components of (11): 


Fr; =0 
F, = — (CyApo/2m){(k/a)*? — ca cos i](k/a)? (117) 
FPF, = — (CpApo/2m) o sin i(ka)? cos 6 
and finds 
& = (2a°/k)F,f 
in = (aF,/k)A — $f) (118) 
& = (a’F,/4k)[2f tan @ — cos 6]. 


Noting that 


= ae _ dt _ _ CyApoa’o sin a (119) 
Ta 


df 4mki 





we have agreement with standard results for the orbital precession 
due to diurnal winds. 
If we extend this drag analysis to 0(e) we find 


fo tfs (a'F,/2k)[% sin f — f cos f — 3f’ sin f] 

gx + gs = (a°F,/2k)[6f sin f + 9 cos f — 3f? cos f] 

hy + hs = [a’F,/(4k cos 0)][§ cos (@ + 2f) — 1 cosa — f sin & 
— fsin (@ + 2f)], 


(120) 


which are simple enough to permit a further extension to cases where 
py = p(f) is variable around the orbit. The details are straightforward.” 

It is of course understood that any of these results should be ac- 
companied by £m: if a general solution is desired. This, however, 
adds nothing to the characteristics of a particular perturbation. The 
formulas (104) to (106) and (108) to (111) can also be applied to a 
variety of other effects such as luni-solar perturbations and radiation 
pressure. 

The motivation behind the results of this section was to give a geo- 
metrically tangible account of perturbed satellite motions over a frac- 
tional period or just a few periods. This may be useful in various tar- 
geting, intercept, and rendezvous operations. On the other hand, the 
formulations of Sections ITT, 4.2, and 4.3 form the beginnings of ephem- 
eris computing techniques and orbit lifetime studies. These subjects 
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have been pursued further in several higher-level methods (see Section 
I), some of which deal partly with the elements and partly with co- 
ordinates and make occasional use of contact transformations. They 
may be considered a stepping stone to full-fledged astronomical pertur- 
bation analyses, about which there exists an extensive literature. 
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Methods of Orbit Refinement 


By R. B. BLACKMAN 
(Manuscript received January 20, 1964) 


During the past six or seven years, several methods of orbit refinement 
were developed specifically for use with artificial satellites and spacecraft. 
This article describes these methods, and the classical method, in a uniform 
mathematical formalism in order to facilitate comparisons of their relative 
advantages and disadvantages for practical systems applications. However, 
such comparisons are made in this article only to the extent that motivated 
the development of the new methods. 


I. INTRODUCTION 


The accuracy to which the position of a satellite or spacecraft can be 
predicted depends upon the accuracy to which the “initial” position and 
velocity vector, or related orbital parameters, are known. Since these 
parameters can be determined only from observational data which in- 
evitably contain observational errors, the accuracy to which they can be 
known depends upon the nature, the quantity, the accuracy, and the dis- 
tribution (in space and time) of the observational data, and the way in 
which these data are processed. The accuracy of the orbital parameters, 
and of prediction, depend also upon the accuracy to which all of the forces 
acting on the satellite or spacecraft are known and taken into account. 
Clearly, the term “‘accuracy”’ must be taken largely in a statistical sense. 

Orbit refinement is essentially data smoothing for the purpose of ac- 
curate prediction. Given the nature of the observational data, and the 
statistical properties of the observational errors, it is possible to formu- 
late a method of data smoothing and prediction which is optimum in the 
sense of giving predictions with the greatest possible accuracy. However, 
such an optimum method will, in general, be useful only asa standard of 
comparative performance for more practical methods. The reason for 
this is that it has not been difficult to find simpler and therefore more 
practical methods which are nearly as accurate as the optimum method. 
(For example, see Ref. 1.) 

It should be noted also that in the practical applications of data- 
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smoothing and prediction methods each application is usually character- 
ized by a unique set of practical constraints. Thus, it has been a frequent 
experience that a method which was judged to be the most practical for 
one application was usually not judged to be the most practical for an- 
other application. In fact, it has frequently happened that a new method, 
perhaps new only in the sense that it is a composite of parts of older 
methods, was developed for a particular system. 

The classical method of orbit refinement, the so-called ‘differential 
corrections’ method (which is essentially the method of least squares, 
developed by K. I’. Gauss in 1795) has served astronomers very well for 
over 150 years. However, this method becomes quite unwieldy for large 
quantities of observational data. Hence, the quantity of data to be proc- 
essed was frequently reduced by the supplementary use of “normal 
places” and/or “smoothing,” described briefly in Section 6.6.1, pp. 141- 
142 of Ref. 2, and later in this article. Thus, up to about 1955 no need 
was felt for another method of orbit refinement, although the basis for 
the development of alternative methods was implicit in a number of 
articles published in the field of general statistical analysis, such as Refs. 
3-6. 

With the development of artificial satellites and space probes, the 
need for alternative methods of orbit refinement began to be felt in 
some quarters. The first definite proposal of an alternative method, as 
far as the author is aware, was made by P. Swerling (Refs. 7-9). A some- 
what different method, independently developed by the author (Ref. 10) 
was used in the Telstar I experimental satellite communications system 
(Refs. 11, 12). Some difficulties experienced with this method after about 
four weeks of successful operation led A. J. Claus (Ref. 13) to develop 
another method which is slightly different from Swerling’s method. In 
addition to these methods, it is worthwhile to include a method of space 
navigation described by R. H. Battin (Ref. 14) because it involves a prac- 
tical detail which, under favorable circumstances, may be profitably 
introduced into the other methods. 

The essential details of these methods will be described here in a uni- 
form mathematical formalism in order to reveal their basic similarities 
and differences, and in order to facilitate comparisons of their relative 
advantages and disadvantages for practical systems applications. 


II. CLASSICAL DIFFERENTIAL CORRECTIONS METHOD. LEAST SQUARES 


Let ¢ be an n-rowed vector representation of the observational (angu- 
lar) data. Every component of this vector is assumed to be labeled to 
identify it as either a declination angle or a right ascension angle, and to 
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specify the time at which it was observed. Let e be the 6-rowed vector 
representation of a set of values of the orbital elements, and Iet ¢(e€) be 
an n-rowed vector representation of the angles which would have been 
observed, assuming that the actual orbital elements are exactly repre- 
sented by ¢«, and assuming that observations are made with ideal ac- 
curacy. If the observational errors are independently and normally dis- 
tributed, with zero means and equal variances, the best estimate of the 
orbital elements is that value of e« which minimizes the quadratic form 


Q = [6 — o(e)]’-[ — e(e)], 


where the prime stands for transposition. This quadratic form is simply 
the sum of the squares of the components of the vector difference 
& — v(e). 

Let e) be an initially assumed value for e, close to the true value. Then, 
to the first-order term in (€ — €0), 


g(e) = v(e) + J-(e — 0), (1) 
where J is the n by 6 matrix symbolized by 
J = d()/de. (2) 


Hence, to second-order terms, 
Ce aie lip J see), 
where r is the n-rowed vector residual 
=  — ge). (3) 
Now, Q is a minimum with respect to « if 
J’-[r — J+(e — e)|] = 0. 
Written in the form 
J’sI-(e— @) = dr, 


this corresponds to the set of 6 equations commonly called ‘normal equa- 
tions.” If J’-J is nonsingular, and if the value of ¢ which satisfies this 
equation is denoted by ~, then, 


€e= ab (IJ) dr, (4) 


This ¢is then substituted for ¢9 in (2), in (3), and in the right-hand mem- 
ber of (4), in order to obtain another @ This substitution procedure is 
iterated until @ has essentially converged. The final ¢ is the least squares 
estimate of the orbital elements. 
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In a statistical sense, this estimate is unbiased to the first order in the 
observational errors, assuming that the errors are not biased. It is only 
asymptotically unbiased to the second order, but it is a “consistent” 
estimate in the sense that the probability is unity that it will be correct 
to the second order as n — . (See Section 4.3 for clarification. ) 


2.1 Classical Method. Weighted Least Squares 


The method described in the preceding section is quite unwieldy for 
large values of n on account of the size of the J matrix. Some relief was 
obtained by resorting to various artifices. For example, if a number of 
observations were made at sufficiently short intervals of time, a straight 
average of these observations would be taken and treated as a single 
observation called a “normal place.”’ (More elaborate methods of deriv- 
ing normal places directly from the observational data are called 
“smoothing” by Baker and Makemson in Ref. 2.) Since normal places 
would be more accurate than single actual observations, in proportion 
to the number of actual observations which went into each of them, it 
was necessary to generalize the differential corrections method to some 
extent. 

The quadratic form to be minimized is now 


Q = [6 — o(e)]'-W- lo — o(e)], 
where TV is an n by n diagonal matrix. In expanded form, it is 
Q = do wile: — eile], 
where the w,;; are the components of W. Thus, the quadratic form is 
simply a weighted sum of the squares of the components of the vector 
difference ¢ — ¢(e). 

The analysis in this case will not be pursued beyond this point, since 
it is a special case of the analysis given in the next section. Suffice it to 
say that if the analysis were carried out for this special case, the results 
would be equivalent to the method used by astronomers when they deal 


with normal places, or with uncorrelated observations of different de- 
grees of accuracy. 


2.2 General Form of the Classical Method. 


If the observational data are not all of the same nature (angles, ranges, 
and range-rates), and especially if some of the errors in the data are cor- 
related, let 6 be the n by n covariance matrix of the n-rowed vector ¢. 
For further generality, let « be an m-rowed vector representation of a set 
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of values of parameters which, in addition to the 6 orbital elements, may 
include such quantities as the frequency of a satellite-borne Doppler 
source, and instrumental biases. Then the quadratic form to be mini- 
mized is 


Q = [6 — o(e)]'-&"-1o — 9(e)].- (5) 


Under the assumption that the components of ¢ obey a joint n-dimen- 
sional normal (i.c., Gaussian) distribution with covariance matrix ®, 
the value of e which minimizes the quadratic form (5) is the maximum 
likelihood estimate of the true value of the parameters. Under any other 
symmetrical probability distribution of the errors in the observational 
data, this value of ¢ is simply the weighted least squares estimate of the 
parameters. 
Substituting (1) into (5) we get 


Q = [r ~ J-(e— eo)! "fr — J-(e — ov)], (6) 


where J and r are defined by (2) and (3) except that J is now an n by 
m matrix, and e is an m-rowed vector. Now, Q is minimum with respect 
to eif 


J'-®" fr — J-(e — @)} = 0. 


Denoting the value of ¢ which satisfies this equation by é, we have 


€=eotC-p, (7) 

where 
C= (J'-87-J)™, an m by m matrix, (8) 
p = JB, an m-rowed vector. (9) 


Equation (7) is the generalization of (4) and, as in the case of (4), it 
is to be used iteratively until é has essentially converged. 

After ¢ has converged, C’ is its covariance matrix. This follows from 
the fact that (6) may be expressed in the form 


Q = (e — @)'-C'-(e — @) + terms independent of «. (10) 


In case the data are all of the same nature (all angles, or all ranges, or 
all range-rates), are all of the same accuracy, and the errors are not 
correlated, then, C = o°-(J’- J)". 

The availability of the covariance matrix C' of the estimate é offers the 
possibility of using the classical method in the intrapass stage of the pass- 
by-pass method described in Section IV, in order to reduce the amount 
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of observational data to be processed at any one time. However, it is not 
the only way to reduce the amount of observational data to be processed 
at any one time. There are other ways which, in particular, avoid the 
computation of the n by m matrix J, where n may be of the order of 400 
for each pass. One such way is Swerling’s method described in the next 
section. Another such way is cited as an example in Section 4.3. A modi- 
fied form of the second way is described briefly in Section V. 


Tl. SWERLING’S METITOD 


Let « be a 6-rowed vector estimate of the osculating orbital elements 
at epoch ¢,, and let C, be its covariance matrix. Let ¢ be the n-rowed 
vector of new observational data more or less centered at epoch te, 
where é > t,, and let ® be its covariance matrix. To obtain the least 
squares estimate of the osculating orbital elements at epoch é, , we must 
first update (i.e., extrapolate) e and C,. If & is the result of updating 
e,, the updated C; is 


C, = M-C,-M", 
where JM is the 6 by 6 matrix symbolized by 
M = 06/0 . 


Then, assuming that the errors in the new data are not correlated with 
the errors in the old data, the quadratic form to be minimized is 


Q = (€ —&)-Cr's(e — &) + [6 — o(e)!-B7 [6 — ee]. (11) 


This is essentially the sum of the right-hand members of (5) and (10). 
Now, to the first-order term in (¢ — &), 


o(e) = o(4) + J-(e — &), (12) 
where J is the n by 6 matrix symbolized by 


Then, to second-order terms, 
Q= (e— 4) Cr (e— &) +r —J-(e-&)/ 7 fr — J -(e — &)], 
where 
r= @-— ¢(&). (14) 
Now, Q is minimum with respect to e if 


Cr -(e = é1) —_— J/-@ fr = J -(eé art é1)] = (. 
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Denoting the value of « which satisfies this equation by é, we have 


e=& + C-p, (15) 
where 

C= [Cy¢ + JT sy, (16) 

p = JB -r, (17) 


Finally, since the quadratic form, to second-order terms, may be ex- 
pressed in the form 


Q = (e — é)’-C’*-(e — @) + terms independent of e¢, (18) 


it follows that C is the covariance matrix of é. Equations (11), (15), 
(16), and (17) correspond respectively to equations (16), (25), (19), 
and (23) in Swerling’s JAS paper (Ref. 8). Further, since e, and C; may 
have been computed at the preceding stage in exactly the same way as 
é and C at the last stage, (11) corresponds also to equation (30) in 
Swerling’s JAS paper. 

With regard to the updating of ¢, and C, it should be noted that esti- 
mates of “time of perigee (or nodal) passage’”’ should be labeled with 
the serial number of the passage. Thus, even in the hypothetical case of 
a purely Keplerian orbit, unless the serial number of the passage is in- 
tended to be the same in é as it is in e, , the vector & will differ from the 
vector «,. The component 7, (time of perigee or nodal passage) of 4 
will be increased to 7 in é , where 7, is 7; plus an integral multiple of 
the period estimate 27a;°”/Vu, where a; is the semimajoraxis component 
of «,. The matrix M will therefore be a unity matrix except for an off- 





diagonal component 07,/da, which is an integral multiple of 3aV ai/ HL. 
In this connection, A. J. Claus has found advantages in using the period 
instead of the semimajor axis as a component of e , especially when per- 
turbing forces are taken into account. 

In the classical method n must be at least equal to the number of 
orbital elements, and it must include all of the available observational 
data — old observational data (which has been processed at least once 
before) as well as new. In Swerling’s method n may be less than the num- 
ber of orbital elements (possibly n = 1), and old observational data are 
represented by the 6 components of e, , the 21 distinct components of the 
symmetrical matrix C; , and the epoch t, . The chief objection to Swerl- 
ing’s method, as far at least as some applications are concerned, is its 
inability to omit any part of the old observational data without reproc- 
essing the remainder of it. This objection is less serious if Swerling’s 
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method is used only in the intrapass stage of the pass-by-pass method 
described in the next section. 


IV. PASS-BY-PASS METHOD 


This method consists essentially of two stages per pass. (These stages 
are not quite the same as Swerling’s stages.) In the first (or intrapass) 
stage, a set of estimates of the orbital elements, and an associated co- 
variance matrix, are computed from the observational data for each pass. 
In this stage, any method of computation which yields a covariance 
matrix for the estimates of the orbital elements may be used. In the sec- 
ond (or interpass) stage, the sets of single-pass (i.e., intrapass) estimates 
of the orbital elements are combined cumulatively (and possibly selec- 
tively), by a method in which the single-pass (i.e., intrapass) covariance 
matrices play important roles. 

Let é and C, have the same significance as in the description of Swerl- 
ing’s method, but let the new data be processed separately to obtain an 
independent vector estimate e of the osculating orbital elements at 
epoch ¢, with covariance matrix C,. Then, the quadratic form to be 
minimized is 

Q = (e— a) Gr (e — &) +(e — @)'-Ce -(e — @). (19) 


This is the sum of two terms similar to the right-hand member of (10). 
Now, Q is minimum with respect to e if 


Git (6 — &) + Cr (e — &) = 0. 
Denoting the value of ¢ which satisfies this equation by é, we have 
€= C-(Cy'+& + C2 *+e) (20) 
where 
CSa(C a). (21) 
Finally, since (19) may be expressed in the form 
Q = (e — @)':C"+(e — @) + terms independent of ¢, (22) 


it follows that C is the covariance matrix of é. (See Appendix for a more 
illuminating derivation). 

Note that a ‘fading memory” can be introduced into the pass-by-pass 
method (or any other method in which the old data are represented by 
e, C,, and ¢,) simply by substituting kC, for C,, where k is a scalar 
constant (k > 1) or 


k = exp ly-(@ — &)] 
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where y is a constant (y > 0). However, at the cost of storing a number 
of single-pass estimates of the orbital elements and the associated co- 
variance matrices, old single-pass estimates of the orbital elements may 
be completely omitted at any time without having to reprocess any of 
the observational data on which the more recent single-pass estimates 
are based. 


4.1 An Hxample 


It is illuminating to see how the method of combination described in 
the preceding section works out in a simple problem. Consider an object 
traveling along the x axis at a known acceleration a. Let 


X11 vs 
q = eg = 
V1 4 Vo , 


be the estimates of position and velocity at epochs 4 and é , and let 


2 
C1 — C, — & °.]. 


2 
it ee | 
Las bs 


V1 + ar 


If b—- kh = Ty then 


whence 


, 0&1 = 1 T 
ee ae 
Then, 


e ox “te Toy TOy" 
C; — M-C,-M' a 2 2 . 


TOy Oy 
Note that the correlation coefficient, which is 


1 


is very close to unity for large values of 7. For this reason, or for other 
reasons, we may expect serious difficulties in numerical computations 
based on (20) and (21) as they stand. These difficulties are considerably 
reduced by the transformations described in the next section. 

In the example under consideration there is of course no difficulty 
about inverting matrices analytically. It is found finally that 
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€= A at epoch t., 


where 
at he 1 2 | 2 2 
v= Age ast {Oz [2(ay = 22) a rv = v2) | a ae 2 Xe} } 
p 1 2 2 2 
U= Rete ig? {202 (vy + v2 + ar) ae Oy T (Xe ices yar )}, 
and 
go Qe% + roy. Toy 
= 
4o,? + ro,’ Ty Qoy 


Under the assumptions implicit in C; and C2, that 2, v1, 2, and v2 
are not correlated, the equations for and 5 may be verified by a more 
familiar method. We have in fact two independent estimates of the 
position at epoch fy, viz., 


2.2 
V1 + U2 ‘ . 2 TO 
Y + Sige with variance o, + a : 





and 
~ . 2 
x. with variance o,. 


Taking the weighted average of these two estimates, each estimate being 
weighted in inverse proportion to its variance, we get #, as expressed 
above. The variance of Z is the harmonic mean of the two variances, so 
that 

2 2 2 2 
Or (Qor + Toy ) 


ed Raley a 


in agreement with the equation for C. 
Similarly, we have three independent estimates of the velocity at epoch 
to ; viz. ) 


. . 2 

v, + ar with variance o, , 
. : 2 

ve With variance a, , 


and 
2 


te — XY 1 . : 20 
eee emslees 5 or with variance —. 
T 


T 
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Taking the weighted average of these three estimates, cach estimate 
being weighted in inverse proportion to its variance, we get 0, as expressed 
above. The variance of 6 is the harmonic mean of the three variances, so 
that 


= 2.2 2 2.2 
var {3} = Jon 0s / (doz + Toy ); 
in agreement with the equation for C. 
The covariance of @ and 6 may also be verified, but the algebra is more 
involved. 
4.2 Transformation of fquattons 


Omitting the explicit notational reference to updating (1.c., extrapola- 
tion) in (20) and (21), we have 


é= C-(Cr +a + Cre), (23) 
where 


C 


(Page Og ee (24) 


These equations require the inversion of three matrices which, at least 
in the case of angles-only data, are usually extremely ill-conditioned. 
Hence, they have been transformed in order to reduce this difficulty. 
The transformations described here were developed by A. J. Claus and 
the author. 

Introducing the identity 


Croan = (Cr + Ce") +g — Cog 
into (23), we get 
€= @ — C-C2- (a — a). 
Now, 
C-Cyt = (Cy + Cy) Crt = [C2-(Cy* + C2)" 
(1+ C2-€y7)7 = (C1 + C2) Cr" 
ColtC Cs) 


Next, let 
P\ = S:C1:8, Py, = S:C2-8, (25) 


where S is a diagonal matrix in which each diagonal term is the reciprocal 
of the square root of the sum of the corresponding diagonal terms of 
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C, and C,. (Thus, every diagonal term of P; + P2 is unity.) Then, 
C1(Ci + C2) = S7-Pi- (Pi + P2) 8. 
Hence, 
@= 6 — S'-Py-(Pi + Pe) S:(a — e). 
Similarly, 


€9 + SPs (Py + P.)* + S+(e atl €). 


Next, let Wi and We be arbitrary square matrices whose sum is a unity 
matrix. Then, 


ée= Wie + Wa: € a R-(a Ss €2), 


E 


where 


R = (Wy-S7-Py — WS7-P2) (Pi + P2)7S. 


Since P,; + P, may yet be ill conditioned for inversion, we now use a 
well-known artifice, and write 


R = [Wy-S”-(Py-G) — We: S*+(Po-G)]-[(Pi + P2) Gl": S, 


where G may be regarded as another arbitrary square matrix although 
it merely represents a set of rules for combining rows and/or columns of 
P, + P2, as well as of P; and P; individually. [The normalization of the 
matrix sum C; + C, to ?; + Pe. and the preservation of its symmetry 
by the introduction of the matrix S, as in (25), simplifies the implementa- 
tion of the matrix G as a set of operational rules.) Finally, W; and W, 
are restricted to diagonal matrices, so that 


R= S°-[Wi-(Pi-G) — We: (Po-G)]-[(P1 + P2)-Gl"-S. (27) 


Noting that the right-hand member of (23) reduces to 2C if e is re- 
placed by C, and e is replaced by C2, it follows from (26) that 


C = 3[Wi-C, + WoC, — R-(C1, — C2)). (28) 
Tor further details, see Ref. 12. 


4.3 Debiasing Single-Pass Estimates 


Depending upon the method used to obtain single-pass estimates of 
the orbital elements in the first (or intrapass) stage of the pass-by-pass 
method of orbit refinement described in Section IV, the single-pass 
estimates may be biased on the average even if the errors in the observa- 
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tional data are not biased. Unless these biases are tolerable, they must 
of course be removed. In this section we will describe a method of remov- 
ing these biases in the single-pass estimates of the orbital elements. 
However, biases in the single-pass estimates of the orbital elements due 
to biased errors in the observational data will not be removed by this 
method. 

In the interest of simplicity, the description will be by analogy with 
a much simpler problem which involves only one observable coordinate, 
one parameter to be estimated, and does not involve time at all. We will 
consider two methods of estimation, of which the first is analogous to the 
classical method of orbit refinement, and the second is analogous to any 
method of obtaining single-pass estimates which are biased on the aver- 
age even if the errors in the observational data are not biased. 

Let y be a function of the observable coordinate x, and let 


Xz, = 1+ «€ (¢ = 1,2, ---,n) 


be the observed values of x, where the e’s are uncorrelated random er- 
rors with ave {e;} = Oand var {e,} = o” for every 7. The most direct way 
of estimating yo = y(2o) is obviously to compute first 


x= C25 23 


and then y(#). The nature of the estimate obtained in this way is deter- 
mined as follows. Since 


= %+ a, 


eS 


where 


a= (1/n) de, 
then, to second-order terms in the ¢,’s, 
y(®) = yo + aoa + $bvor, 
where ao = dyo/dx , and by = d’yo/dao. Now, 
ave {a} = 0, ave {a} = a'/n. 
Hence, to second-order terms in ¢, 
ave {y(£)} = yo + (boo'/2n), 
and 


var {y(£)} = ave {[y(#)]}"} — [ave {y(#)}f = aoo'/n. 
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If c’ is not known, an estimate of var {y(4)} is given by 


-\2 


var {y(Z) } Se) Dio) 
Since 


lim ave {y(X)} = yo, 


Nn>o 
the estimate is asymptotically unbiased; and sinee 


lim var {y(%)} = 0, 

the estimate is “consistent”? in the sense that the probability is unity 
that it will be correct to at least the second order as n — ©. These re- 
sults are indicative of the nature of the estimates of orbital elements 
obtained by the classical method of orbit refinement, in which the esti- 
mates are such as to “predict”? values of the observable coordinates 
which agree with the actual observations in the least squares sense. 

Now, consider another way of estimating yo. We compute y(2) for 
each observed value of x, and define the estimate of yo as 


fo = (1/n) J yi where y; = y(ai). 


The purpose of estimating 7 in this way is to permit the estimation of 
var {} without using ao. Thus, 


var {io} AT > R? 


where 
Rk; = Yi — Yo . 


The importance of this is that the analog of ao in the computation of or- 
bital elements from, say, two complete radar fixes (each fix consisting of 
a range and two angles) is the inverse of a 6 by 6 matrix whose compo- 
nents are functions of the epochs of the two fixes. The computation of 
this matrix may be avoided by computing a set of orbital elements from 
each pair of radar fixes, averaging over the sets, and estimating the co- 
variance matrix from the residuals. 

The nature of the estimate fj is determined as follows. Since, to sec- 
ond-order terms, 


Be oc 1 2 
Yi = Yo + doe; + aoe, ) 
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then 
Go = Yo + doa + FB, 


where a is as previously defined, and 


B = (1/n) Dd &. 
Now, 
ave {8} =o. 
Hence, 
ave {fo} = yo + (boo"/2), 
and 


var {Go} = ave {[Gol’} — [ave {G}]? = ava'/n. 


Thus, fj is a biased estimate of yo . In particular, it should be noted that, 
while the variance of this estimate decreases with increasing n, the bias 
in the estimate is independent of n. Hence, the variance bears no rela- 
tion to the accuracy of the estimate. 
The bias may be removed by the following supplementary procedure. 
1. Compute 4) such that 


y(to) = Jo. 


This is analogous to computing artificial tracking data (at the same 
epochs as the actual tracking data) from the biased single-pass estimates 
of the orbital elements analogous to 9% . The method of computing track- 
ing data from orbital elements must of course be numerically compatible 
with the method of computing orbital elements from tracking data in 
the absence of observational errors. 

2. Compute 


£; = 24) — x, 4=1,2,---,n. 


This is analogous to combining the actual tracking data with the arti- 
ficial tracking data computed in the preceding step. (The choice of a 
combination such that the random error in each £; is equal in magnitude 
but opposite in sign to the random error in the corresponding x; was 
suggested by D. R. Brillinger. ) 

3. Compute 


Yi = y(&,), UN dy eee 


This is exactly the same procedure used in computing y; = y(2;). 
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4. Compute 

Gi = 2(8yi — Ye), CS, eos 
5. Compute 

Go = (1/n) >-%; as the estimate of yo. 


The nature of the estimate % is determined as follows. To second-order 
terms, 

Pee ih ey Se a” 
2a 


a) 
Hence, 


re ee (ee eee (eee) ee 
ao 
Gi = yo — ales — 2a) + 2 — 4ae; + 2a? + 28), 


Gi = Yo + ap(2e; — a) + (ce; + 2ae; — a — 8), 


Jo = Yo + aow + 4oa’. 


Thus, to second-order terms in the ¢,’s, J is the same as y(Z). It is asymp- 
totically unbiased, and it is a consistent estimate of yo. It may be abso- 
lutely debiased, to second-order terms in the ¢,’s, by changing step 4 of 
the supplementary procedure to compute 


teil a oe ie 
Use A145 — us (1 +a]. 


Then, in determining the nature of the estimate 7 , we now have 





eee ao _ ane 
Leer a [(2n — 1) & — ne] 


= ny [(n — 1) ef + 2nae; — na’ — nl, 


Yo = Yo + Aga ++ 5 bo (na? — B). 


(n — 1) 
Hence, 


ave {fo} = yo. 
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Generally, if step 2 of the supplementary procedure is changed to 
compute 
4; = 3[(11 + w)to + (1 — w)ai, 
and step 4 is changed to compute 


§.= 70+ Wy + A — Wg, 





then, 

onl (l— W)bo fl —w , = te 
Do > Yon eas ey aes 4 + pp IF 
whence, 





ave {Ho} = yo+ 4 + —— + 


(1 — W)bdoo’ ? Sao. bap. Ae | 
4n 4 1-—Wwty 


Hence, the estimate 7 is asymptotically unbiased if 





w+ 7 
W= ae 
and is absolutely unbiased if 
v= nd 
WW Sas 


The choice of w should be made with some regard to the fact that 


#3) =<'|1- (n= 1) 140) Ga). 


ar {&; 
var {4 af 


The choice is w = 3 in step 2 of the supplementary procedure, and 
W = 2 instep 4, so that var {4,}) = o” = var {2}. 


V. CLAUS’S METHOD 


The pass-by-pass method described in Section IV was used at the An- 
dover, Maine, station of the Telstar I experimental satellite communica- 
tions system. The intrapass estimates of the orbital elements were 
computed from angles-only data by a method described in some detail 
in Refs. 11 and 12. Suffice it here to say that a set of orbital elements is 
computed from each set of four sightlines (of which there may be as 
many as 200 in a single pass), the sets of orbital elements are averaged, 
and an estimate of the covariance matrix is computed from the residuals. 


902 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


(Compare this outline with that of the example cited in Section 4.3, which 
uses complete radar fixes, including range. It should be noted that unless 
the number of sets of orbital elements is at least equal to the number of 
orbital elements the covariance matrix will be singular.) This method 
gave excellent results for about four weeks. After that period it began to 
give sporadically bad results — typical errors of 10° feet in single-pass 
estimates of the semimajor axis. 

Extensive simulation studies by W. C. Ridgway III showed that most 
of the trouble was probably due to the sensitivity of single-pass estimates 
of the orbital elements to a bias error in the elevation angles, where these 
estimates are derived from single-tracker angles-only data. This result 
was subsequently confirmed by some formal analysis by the author. 
Ridgway’s studies showed, in particular, that the sensitivity increases 
rapidly with decreasing length of pass, and with increasing maximum 
elevation angle, although sightlines at elevation angles over 82.5 degrees 
(or under 7.5 degrees) were not used. This is consistent with the fact 
that the sporadically bad results began to occur when the perigee of 
Telstar I had precessed sufficiently to make the passes at Andover sub- 
stantially shorter than they were during the first week, and a substan- 
tially larger proportion of the passes had high maximum elevation angles. 
Ridgway’s studies showed that, under these conditions, an error of 105 
feet in the single-pass estimate of the semimajor axis could easily be due 
to a bias of 0.01 degree in the elevation angles. 

In order to overcome the sensitivity of a single-pass single-tracker 
angles-only method to a bias error in the elevation angles, on short passes 
with high maximum elevation angles, A. J. Claus has developed a method 
of orbit refinement which permits the use of a few, perhaps only one or 
two, measurements of range in each pass. This method was intended 
to be used in the intrapass stage of the pass-by-pass method described in 
Section IV, but it may be used as a self-sufficient method, just as Swerl- 
ing’s method may be used either in the intrapass stage of the pass-by- 
pass method or as a self-sufficient method. 

Although Claus developed his method with no foreknowledge of 
Swerling’s method, it turns out that his method differs from Swerling’s 
method only in the explicit introduction of an iterative routine which, 
as in the classical method, improves the estimates of the orbital elements. 

Instead of (12) we now substitute (1) into (11) so that 


Q = (e— &)' Cy (e — &) 
+ [r— J-(e— eo) |’: [r — J-(e— e)], 
where J and r are defined by (2) and (3), and e) has the same signifi- 


(29) 
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cance as in the classical method. Now, Q is minimum with respect to «€ 
if 
Cr (e—&) — J’ -@ 7" [r — J-(e — «)] = 0. 


Denoting the value of ¢ which satisfies this equation by é, we have 


€= C-(O,7-& + J’ Je + ol, (30) 

where 
Cal Ie dr. (31) 
p = JP -r, (32) 


Equation (30) may be expressed in the form 
€=44+C-[J'-@7-J+(e — &) + ol, (33) 


where C, J, and p (the latter through r as well as J) depend implicitly 
on e9. Comparing this with (15), it will be seen that the iterative use of 
(30) or (33), substituting ¢ for ¢ at each iteration, until the difference 
between e and € is negligible, is precluded in Swerling’s method by the 
constraint 6) = &. 

Finally, since (29) may be expressed in the form 


Q = (e — &/-C™(e — @) + terms independent of «, (34) 


it follows that C is the covariance matrix of é. 


VI. BATTIN’S METHOD 


This method, as described in Ref. 14, is essentially a special case of 
Swerling’s method. However, by introducing new data one at a time, 
Battin’s method avoids the inversion of matrices. (Swerling’s JAS article 
contains equations whereby matrix inversions are avoided if new data 
are introduced one at a time, but these equations appear at the end of the 
section entitled “Statistics of Propagated Errors,’’ and their use for 
avoiding matrix inversions is not explicitly stated.) 

In the notation of Section III, (15) may be written in the form 


é= & + [ J’: J + al Oil Maer kee 2 (35) 


Q- ‘ ss . . . ° 
where o’ is the variance of the scalar ¢, J is a 1 by 6 matrix (i.e., J’ isa 
6-rowed vector), and r is a scalar. Now, if 


a=J-Cy J’ +o (ascalar), (36) 


then, 
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i = pp Lav! + a 


a 


Sd SC + ola’, 


Ly aed + #0" | Crd : (37) 
Substituting (37) for the last J’ in (35), we get 
e= a+ GJ’. (38) 
Further, since (16) may be written in the form 
CH0ol J+ eC), 
then, 
C-G =e J+ et y'- Gy, 

and, therefore, 

[esd toe Gy (6 — G) = —d IG. 
Substituting (37) for J’ in the right-hand member, we get 


A 


C= 6,-——G.J'-J-C). (39) 


Ql 


Equations (36), (38), and (39) are equivalent to equations (80), (29), 
and (33) in Battin’s paper (Ref. 14). 

From this description of Battin’s method, it is evident that the in- 
version of matrices may be avoided also in Claus’s method if new data 
(perhaps only the range data) are introduced one at a time. Comparing 
(33), (81), and (32) with (15), (16), and (17), it is clear that (38) and 
(39), with (36), are valid in Claus’s method if 


r= @-— (6) + J-(e— &), (40) 
and 
t= 0¢(€0) /de0. (41) 


The initial value of e) may be taken equal to é, but thereafter é is re- 
peatedly substituted for « until the difference between é and ¢p is negli- 
gible. 
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VII. MOD-S AND MOD-C METHODS 


Swerling’s method and Claus’s method can be modified to permit the 
introduction of new data n at a time, where n < 6, at the cost of having 
to invert a matrix of order n. This might be worthwhile for n = 2 or 3. 

With regard to Swerling’s method, let 


A=J-6,-J'’ +6 (an n by n matrix). (42) 
Then, 
Je = Jot [Oy J! + e]-AT 
[oT + CG AT, 


Substituting this into (17), substituting the resultant expression for p 
into (15), and taking account of (16), we get 


e=& t+ Cy J Aer, (44) 


(43) 


Equation (36) is a special case of (42), (38) is a special case of (44), and 
(39) is a special case of 


C = C1 — C,-J'-AT SCY . (45) 


With regard to Claus’s method, (42), (44), and (45) are valid if r 
and J are defined by (40) and (41), and the repeated substitution of 
é for ¢ is carried out until the difference between e and é is negligible. 


VIII. SUMMARY AND CLOSING REMARKS 


8.1 Summary 


The classical “differential corrections” method of orbit refinement, 
occasionally supplemented by the use of “normal places,” is appropriate 
for astronomical bodies whose relative positions change very slowly, 
whose relative angular positions, viewed from the earth, can be measured 
with extreme optical precision, and which therefore require compara- 
tively small quantities of observational data to establish their orbits with 
great accuracy. However, that method is very unwieldy for artificial 
earth satellites or short-range space probes, where the relative inaccuracy 
of the observational data must be offset by greater quantities of observa- 
tional data. 

Tor artificial earth satellites or short-range space probes, Swerling’s 
method is more practical, chiefly because it does not require all of the 
observational data to be processed together. However, to omit any part 
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of the old observational data which has been processed by Swerling’s 
method it is necessary to start all over again in the sense that all of the 
old observational data which are to be retained must be reprocessed in 
the same way, along with any new observational data which might be 
available. 

The pass-by-pass method, on the other hand, operates on the princi- 
ple of computing an independent set of estimates of the orbital elements 
for each pass, and combining the sets of single-pass estimates in an opti- 
mum way. Thus, entire blocks of observational data may be omitted 
without actually reprocessing any of the old observational data from 
which single-pass estimates of the orbital elements have already been 
computed. The method of obtaining the single-pass (intrapass) estimates 
is optional. In the Telstar I experiments, it consisted in dividing up the 
data (no range data) into mutually interlaced independent sects of four 
sightlines, computing a sct of orbital elements from cach set of four 
sightlines, averaging over the sets of orbital elements, and computing a 
covariance matrix for the average set, from the residuals. This particular 
intrapass method introduces biases (apart from the biases in the data) 
and a method of eliminating these computational biases was developed 
but was not used in the Telstar I experiments. After four weeks of suc- 
cessful operation, a more serious source of trouble arose, which was 
traced to the increasing sensitivity to bias (residual boresight error and 
sample bias) in the elevation angle data. This inereasing sensitivity was 
associated with the precession of perigee to latitudes close to that of the 
tracker. 

To overcome the sensitivity of the angles-only intrapass method to 
bias in the elevation angle data, Claus developed a method, intended to 
be used chiefly in the intrapass stage of the pass-by-pass method for 
Telstar II, which could accept occasional range data. This method is 
essentially Swerling’s method supplemented by an iterative routine 
which improves its accuracy. 

In Swerling’s or Claus’s method, the inversion of six-by-six or higher- 
order matrices can be avoided by borrowing a detail from Battin’s 
method of spacecraft navigation, provided that the observational data 
are processed only one at a time, as in Battin’s method. However, at the 
cost of inverting n-by-n matrices, where n < 6 (say, 2 or 3), the observa- 
tional data may be processed n at a time, by modified forms of Swerling’s 
or Claus’s method. 

The comparisons of the methods described in this paper have been 
made only to the extent that motivated the development of the newer 
methods. The practical details of these methods are interchangeable to 
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a large extent, so that different and more appropriate combinations of 
these details may be made for other specific practical applications. 


8.2 Closing Remarks 


Claus has pointed out that, in principle, none of the alternative meth- 
ods described in this paper, including the use of normal places in the 
classical method, can be as efficient, in a statistical average sense, as the 
classical method without normal places. The reason for this is simply 
that the classical method without normal places allows the maximum 
possible freedom in fitting the estimates of the orbital elements to the 
observational data. Hence, the choice of a method, or combination of 
methods, for a particular application, usually involves a small sacrifice 
in accuracy. 
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APPENDIX 


Equations (20) and (21) may be derived in another way (essentially 
that used by Aitken in Ref. 3) which does not depend upon the minimiza- 
tion of a quadratic form and provides further insight into the signifi- 
cance of these equations. 

Let x be the true value of an n-rowed vector (one-column matrix). 
Let % be an unbiased estimate of x, with 


ave {( — x)-( — x)'} = C, 


where the prime stands for transposition and ‘‘ave” stands for ensemble 
average. The nth-order square matrix C is the covariance matrix of %. 
Let @ be another unbiased estimate of x, with 


ave {(4 — x) :-(4 — x)" = C, 
and let it be assumed that % and ¢ are independent, so that 
ave {(Z — x)-(£— 2x)} =0. 
Now, consider the weighted linear average 


e=Wttw-s (46) 
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where W and W are nth-order square matrices, with 
W+wWe=l (47) 
where J is the nth-order unit matrix. Since 
$-«v=W-(%-—2) + W(4—--2), 

it readily follows that if 

C = ave {(% — 2)-(& — x)" 
then 

C=WOW +LW-CW. (48) 
Each of the diagonal elements of C, viz., 


Cu = 2d (Wis Cine Wa + Wiz: CineWar) 
Jy 


will be minimized under constraints on Wa, + Wis by minimizing 
‘ Ci; — 2 Z Lar (Wie + W ix) 
where the \,,’s are Lagrange multipliers. This requires that 
Dd Wiz Ca = Vee, DS Wis Cn = de, 
J j 


for every 7, k. In matrix notation, 

WC=r, WC=r 
where X is the nth-order square matrix of the ),’s. Hence, 
W=rC7, W=r-C4, (49) 
where, to satisfy (47), 
= (07% + 67)". (50) 


~ 


By (48) and (49) 
n-(O7 + ON, 
’ by (50), 


Q 
ll 


but, since \ is a symmetrical matrix, 
C=. (51) 
Finally, by (46), (49), and (51), 
& = C-(C71-% + 67-4), (52) 
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where, by (50) and (51), 


Ca(C ee). (53) 


This derivation shows that the diagonal elements of C, which are the 


variances of the components of %, have been minimized independently 
of one another. 
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On the Response of Nonlinear Control 
Systems to Periodic Input Signals 


By IL. W. SANDBERG 


(Manuscript received June 20, 1963) 


In this paper we study a broad class of nonlinear control systems con- 
taining a single memoryless nonlinear element. We present conditions 
under which there exists a unique periodic response, with a given period, 
to an arbitrary periodic input with the same period, and we derive an upper 
bound on the mean-square error incurred by applying the describing- 
function technique. The expression for the error reflects the inturtive en- 
gineering arguments that are often employed to justify the use of the de- 
scribing-function method. Conditions are also presented under which 
subharmonic response components and self-sustained oscillations cannot 
occur. 


I. INTRODUCTION 


The describing-function technique is often used to determine the 
response of nonlinear control systems to sinusoidal input signals. In 
this approach,* which is applicable to systems of any order but which 
is ordinarily restricted to systems containing only one nonlinear ele- 
ment, it is assumed that the response is periodic, with only the com- 
ponent at the input frequency significant. 

Although the describing-function technique is of considerable prac- 
tical value and indeed is one of the most powerful analytical tools 
available to the control system synthesist, it appears that, except with 
regard to predicting the existence of self-sustained oscillations,* there 
has been no rigorous discussion of its validity. 

In this paper we study a broad class of nonlinear control systems 
containing a single memoryless nonlinear element. We present condi- 
tions under which there exists a unique periodic response, with a given 
period, to an arbitrary periodic input with the same period, and we 

* The describing-function technique was discovered independently by engineers 


in at least five different countries.1"* 
{| However, some interesting relevant ideas have been presented by Johnson.’ 
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derive an upper bound on the mean-square error incurred by applying 
the describing-function technique. The expression for the error reflects 
the intuitive engineering arguments that are often employed to justify 
the use of the describing-function method. Conditions are also presented 
under which subharmonic response components and _ self-sustained 
oscillations cannot occur. 

Some mathematical preliminaries are considered in Section II. In 
Section III we describe the physical system to be studied, introduce 
some assumptions and notation, and discuss the describing-function 
technique. The remaining sections are concerned with mathematical 
results relating to the functional equation that governs the behavior 
of the physical system. 


Il. MATHEMATICAL PRELIMINARIES 


Let ® = [0,p] be an arbitrary metric space.” A mapping A of the 
space ® into itself is said to be a contraction if there exists a number 
k < 1 such that 


p(Az,Ay) S kp(2,y) 


for any two elements 2,y ¢ 8. The contraction-mapping fixed-point 
theorem® is basic to much of the subsequent discussion. It states that 
every contraction-mapping defined in a complete metric space ® has 
one and only one fixed point (i.e., there exists a unique element z ¢ 0 
such that Az = z). Furthermore z = lim A”2, where 2» is an arbitrary 


element of 0. 

Let 7 be a real positive constant. The space of real-valued periodic 
functions of ¢ with period 7 which are square-integrable over a period 
is denoted by K. The norm of g ¢ & is denoted by || g || and is defined by 


1 


alice A gat 


(i.e., || g |] is the rms value of g). With this norm X is a Banach space. 
Ifig eK, 


N 
g=lim. >> gne’”* 


Noo n=—N 


where w) = 27/7 and the Fourier coefficients g, are given by 


1 T ee 
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Parseval’s identity reads: 
> lo = Ig Ie. 


Two elements of ®, g and h, are equivalent if || g — h || = 0. 

In accordance with the usual notation, the norm of a linear operator 
Q defined on & is denoted by |] Q ||. 

The symbol £:, denotes the space of real-valued absolutely integra- 
ble functions defined on the real interval (—«©,«). We take as the 
definition of the Fourier transform of f(t) € Liz : 


Fay = i y(t)e"at, 
The symbol I is used throughout to denote the identity operator. 


Ill, DESCRIPTION OF THE PHYSICAL SYSTEM, THE DESCRIBING-FUNCTION 
TECHNIQUE, AND THE PROJECTION OPERATOR P 


We shall be concerned with the familiar nonlinear control system 
shown in Fig. 1. 
Assumption I: It is assumed throughout that F (in Fig. 1) ts a linear 
operator. Let § = { ---, F»2,P1,Fo,f1, 2, -+-} denote a countable 
set of complex constants such that sup | F,| < © and F,, is equal to the 


complex conjugate of F_, . Unless stated otherwise, it ts assumed that the 
restriction of F to K 1s a bounded linear mapping of K tnto itself with the 
property that if g eK andh = Fg, thenh, = Fig, in which g, and h, , 
respectively, are the nth Fourier coefficients of g and h. (According to the 
Riesz-Fischer theorem, F is completely defined on ® by &.) 

The class of operators consistent with Assumption I includes the 
important special case in which 


Fg = [10 -aG@ar, geKk 


where f(t) € Liz (see Appendix A). Here F,, = F(inw.) where F'(iw) 
is the Fourier transform of f(¢). 


(eee eerste 


I 


Fig. 1 — Nonlinear control system. 
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Assumption II: The nonlinear function py in Fig. 1, which introduces 
the constraint v(t) = Ylw(t)], 7s assumed throughout to be real-valued, 
independent of t, and such that there exist two real constants a and B (8 > 0) 
with the properties that 3(a + B) = 1 and 


a(n — we) S Wl) — We) S Bur — we) 
for any real pw, = po. 


The normalization $(@ + 8) = 1 permits some simplification in the 
subsequent statements of results. 

In Fig. 1 the input signal y is related to the output signal « by the 
functional equation 


x= Fylx + y}. 


3.1 The Sinusoidal Response of the System in Fig. 1 


The response of the system in Fig. 1 to a sinusoidal input is frequently 
of engineering interest. Typically the system is of high order* so that 
the well-known special techniques applicable to second-order systems 
cannot be used. The describing-function approach simplifies the prob- 
lem by assuming that the output is periodic and that the only signifi- 
cant frequency component of the output is that component at the 
input frequency. Hence it is assumed that the input to the nonlinear 
device is a sinusoid and that y is characterized by the ratio of the funda- 
mental component of its output to the amplitude of the sinusoidal 
input (this ratio is called the describing function for y). Thus the non- 
linear element is treated as an element with a gain that varies with 
input signal level, and to the extent that the describing-function ap- 
proximation (sometimes called the ‘first harmonic approximation’’) 
is valid, the usual frequency response methods can be employed. 

The first harmonic approximation is often ‘“‘justified” on three grounds: 
first, no significant subharmonic components of x(t) are ordinarily 
present; second, the harmonics of the output of WY are ordinarily of 
smaller amplitude than the fundamental and, third, in most feedback 
systems F behaves as a low-pass filter with the result that the higher 
harmonics are significantly attentuated. 

Aside from at least two computational difficulties’ associated with 
the describing-function method, which can be remedied to a consid- 
erable extent with machine aids, “The third and most basie difficulty 
is related to the inaccuracy of the method and, in particular, to the 


* That is, the nonlinear differential equation governing the system is typically 
of high order. 
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uncertainty throughout the analysis about the accuracy. There is 
[in the literature] no simple method for evaluating the accuracy of 
the describing-function analysis of a nonlinear system and no definite 
assurance that the results derived with the describing function are even 
approximately correct.” However, it should not be inferred that the 
accuracy is necessarily poor.’” “Indeed the correlation between ex- 
perimental and theoretical results is in many cases better than the 
accuracy of the design data.’’” 


3.2 The Role of the Projection Operator P 


A moment’s reflection will show that the deseribing-function tech- 
nique as applied to the system in Fig. 1 amounts to analyzing the 
approximating system that results by replacing the operator F with the 
operator F defined by 


T 
7 i [Fogle "°o'dt = Fign, n= +1 
0 


= 0, n ~ Al 


where g € &, g, 18 the nth Fourier coefficient of g and T = 27/oy is the 
period of the input sinusoid. 

At this point it is convenient to introduce 
Definition I: Let 9 denote a set of integers such that —m ¢ N if m € N. 
Let g be an arbitrary element of K with nth Fourier coefficient g,, . 
The projection operator P ts a linear mapping of K into itself defined by 


T 
a [Pyle *"°"'dt = gn, meN 
0 


= 0, ng. 


An obvious generalization of the describing-function technique is to 
take as the approximating system the system that is obtained by re- 
placing F in Fig. 1 with PF. The results to be presented relate to this 
more general situation. Of course in the case of principal interest, 
9 = {—1,1} and PF = F. 


IV. RESULTS RELATING TO THE FUNCTIONAL EQUATION & = Fy[x + y] 


The proof of the following simple preliminary result is given in Ap- 
pendix B. 
Theorem I: 


F | = sup | Fy 
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Tf inf |1 — F,| > 0, the operator (I — F) possesses a bounded inverse 


on & and 





| (I — F) 'F || = sup 





F,, | 
1 — F,\° 


The principal result of this section is 
Theorem II: Let F, ¥, and B be as defined in Section III. Let y € K. 
Suppose that 





r = sup 
n 


Then there exists a unique x € K® such that « = Fy[x + yj. In fact, « = 
lim xm where 


M>o 


Umq1 = (I — F)F{ [2m +y) - a 


and x ts an arbitrary element of K. The mth approximation xm satisfies 


tm — 2) Sp a — ao 





Proof: 
Let ¥[w] = Yow + fw], where Yo is a real constant such that 
inf |1 — Yl", | > 0 (since r < 1, there exists such a yo). According to 


Theorem I, (I — yoF) possesses a bounded inverse on ®. Hence the 
functional equation « = Fy[« + y] can be written as « = Mz where 


Mex = (I — WF) Fee + y) + yo(I — F) Fy. 


In order to prove Theorem II it is sufficient to consider the case in 
which Yo = 1. However, we prefer to bring out the fact that this choice 
of Yo , the median of a and 8, Is optimal in a significant sense. 

It is evident that M is a mapping of & into itself. Let us consider 
the determination of a condition under which M is in fact a contraction- 
mapping of & into itself. Let g,h « K and observe that 


| Mg — Mh || = (1 — oF) F{vlg + y] — Oth + yl} | 
< (1 — woF)F\lll¥lg + yl — oth + yl. 
Since 
Io +l — dhe = | (MB _ ,) Gal, 


lA 


| Yio + yl — WA + yl S nod |g — All 
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where 


IIA 
— 


n(yo) = B — wo, Wo 
=  — a, Y 2 1. 
Thus 
|| Mg — Mh || S$ (1 — WF) F || n(do) [lg — AIL, 
and M is a contraction if 
q(vo) = [\(I — oF) “F || n(o) < 1. 
Using Theorem [, 


qo) = sup n(yo). 








F,, 
1 — WoFn 


Assuming that there exists a Y such that q(wo) < 1, the following 
result, which is proved in Ref. 11, implies that inf g(¥o) = ¢(1). 
vo 


Lemma. I: Let & be a complex number and suppose that | £ — Wo |~'n(vo) < 1. 
Then 


lé — wo ln) 2 }E-— 1 'n(). 


From this point on we assume that y = 1 and we set g(1) = r. 
Thus the assumptions stated in Theorem II imply that M is a contrac- 
tion. In view of the contraction-mapping fixed-point theorem, this 
establishes the existence and uniqueness of the function x(t) and the 
fact that it can be determined in accordance with the stated iteration 
procedure. 

The upper bound on || 2, — 2 || follows directly from the fact that 
x can be written as 


x= at Do [egan — 2, 


ws 
Oo 


in which, for all 7 = 1 
lua — 2 [| = ||] Mz; — Mag_y|| S rll 2; — xG-l. 
Remarks: 

Observe that a nontrivial self-sustained periodic oscillation with 
period T cannot exist in the system of Fig. 1 if the hypotheses of Theorem 
II are satisfied and ¥(0) = O (since then y = 0 implies that x = 0). 

When F is defined by 


Fo = [s(t — r)o(e)ar 


918 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


where f(t) ¢€ Liz, Theorem II implies the following simple necessary 
condition for the occurrence of jump-resonance phenomena” in the 
system of Tig. 1: 


F Gw) 








ag {= £Go) (G1) 2-1. 
Tor example, let F(iw) = —k[(iw + a)(iw + 1)\ where k and a 
are positive constants. Let 
y[w] = 2w, |w| Se 
= 2c sign(w), |w|>c 


where c is a positive constant. Then it is a routine matter to show that 
jump-resonance phenomena can occur only if k > 4(1 + a’). 


4.1 Two Further Consequences of Theorem IT 
Corollary I: Suppose that the hypotheses of Theorem IT are satisfied. Then 


1 Z 
||| Ss = (I — F) ‘Fyly] |. 

Proof: 

Set m = 0 and ay = 0 in the upper bound for || x, — 2 |). 
Corollary IIT: Suppose that the hypotheses of Theorem II are satisfied and 
that & © K satisfies ¢ = PFy[¢ + y]. Then 

1 = ‘ 
|e - 4] S$; @-F) "FC — Pople + gl |. 

Proof: 

With m = 0 and a = 4, the upper bound for |] 2» — 2 || yields 


la —#|| < i | — F) CF {pl + y] — #} 


— (I — PF) PF {y[¢ + y] — 4}lL. 
Since (I — PF)" = (I — P) + (I—F) 'Pand (I — P)é# = 0, 
|e —4|| $+ 10 - FF — Pyle + a 


Remarks: 
Note that the hypotheses of Theorem II imply that there exists a 
unique ¢ ¢ K such that 4 = PFy[¢ + 4. 
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The bound on || « — # || can be expressed with the aid of Parseval’s 
identity as 
a\ 3 
) (1) 


where p, is the nth Fourier coefficient of yl + y]. Consider the usual 
describing-function case in which 9t = {—1,1}, and y is a sinusoid with 
period 7’. Assuming that y is an odd function so that »o = 0, (1) clearly 
shows that || « — 4 || is small when the amplitudes of the harmonics* 
of ¥[@ + y] are sufficiently small or when the attenuation of the har- 
monics by F is sufficiently large. Thus subject to the key inequality 


P, 


|e — #| to 





s7+,(2 


n eMC 











n 











sup 
n 


(1) makes precise the usual intuitive engineering arguments regarding 
the applicability of the describing-function method for determining the 
sinusoidal response of the system in Fig. 1. 

It can be shown” that if a = 0, ¥(0) = 0, and y = Py: 


| Pyle + wlll 2 ole + yl 
Under these conditions 
(I — Pyle + ylll = (ll vlé + yl — |] Pele + yl)! 
< (Pf —o’)' 4+ y[l 











and 
|z—#] s Ia - PFC = PIG —a«)|@+y|| 
1 F, Fitts 
Sep ee 1 Fe | (8° — a) | @ + y |. 





Under the conditions stated in Theorem II, the response z(t) can 
be determined in accordance with an iteration procedure for which the 
successive approximations converge in the mean-square sense at least a 
geometric rate. In particular, if we take x) = 4, the solution given by 
the describing-function method, the second approximation is 


a = (I—F) F{y[t + y] — 4}, 


* Of course here all even harmonic components vanish. 
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which can be evaluated in a relatively simple manner once ¢ has been 
determined. Using Theorem II and the expression for || «1 — 2o || ob- 
tained above, 


r 
1l—r 





(I — F) CF — P)vlé + yl]. 


|e -all Ss 


4.2 An Observation Relating to the Necessity of the Assumptions in The- 
orem II 


The basic assumption in Theorem II is: 


PF, 
1— Fr 








sup \(e —-1) <1. 
This inequality is satisfied if and only if the numbers (F,,)” are bounded 
away from the disk centered in the complex plane at (1,0) and having 
radius (8 — 1). It is of interest to note that there is a function y, in 
fact a linear y, that satisfies Assumption IT and possesses the property 
that there is no function «(t) ¢ K such that « = Fy[z + y] if, for some 
integer k: 
(1) F, #0 
(it) F;, is a point on the real-axis diameter of the disk mentioned 
above, and 

(iit) the kth Fourier coefficient of y does not vanish. 
To prove this assertion observe that if the three conditions are satisfied, 
a < (F,)” S B, and « = Fy[a + y] with pw] = (F:)"w, possesses 
no solution belonging to K. 


This observation suggests that the assumptions made in Theorem 
II are not too far from being necessary. 


4.3 On the Boundedness of the Solution 


Theorem III: Let F be defined by 


Fy = [ fi-s)gG)dr,  gex 


where f € Sir, and let p satisfy Assumption II. Let x e K satisfy x = 
Fy[z + y], y ¢ K. Suppose that (1 + |¢|)f(t) ts square-integrable on 
(— 0,0). Then | x(t)| 7s uniformly bounded on0 St S T. 
Proof: 

Let h = y[x + y] and note that h ¢ K. Using the Schwarz inequality, 
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h(t — r) 

coal ath 
“|h(t — 7) ; 
ears 


joo =| [a + Ir die) 





IA 


dr. 








[ta +lebs 


The last integral can be bounded as follows: 


=| h(t — 7) |’ 2 fOPhT TAC — 7) P 
d = 
[. 1 + 14+ fr] | as a: ie 1 + aes | 


29 (1 + (T)” = n*) [ | h(t) |? dr. 


Thus | z(¢)| is uniformly bounded on 0 S$ ¢ S 7. 
Remarks: 

The hypothesis regarding f(t) is almost always satisfied in cases of 
engineering interest. 

If the hypotheses of Theorem III are satisfied and | y(¢)| is uniformly 
bounded on [0,7], it follows that 2(¢) is continuous on [0,7'], since 
| A(£)| is uniformly bounded on [0,7] and 


dr 








[eta — f(t)| dt > 0.as 6 > 0. 


V. RESULTS RELATING TO AN IMPORTANT SPECIAL CLASS OF CONTROL 
SYSTEMS 


One frequently encounters”’’ discussions of systems of the type shown 
in Fig. 1 in which y is an odd function and the operator corresponding 
to F is characterized in the frequency domain by a transfer function 
(typically a real rational function in zw) with at least one pole at w = 0. 
The techniques presented earlier can be applied to situations of this 
type if F is replaced with F’, the restriction of F to the subspace* 


ro 
K' = (0 g EK; i g(t)e ***'dt = 0, n even} ; 


The operator F’ is completely defined on &’ by the set of complex 
numbers 3 = {---, F_3, F.1, F,, Fs, ---}. The result analogous to 
Theorem IT is 

Theorem IV: Let F’ and &’ be as defined above. Let y and B be as defined 
 *Ttisa simple matter to show that the linear manifold #’ is in fact a subspace 
of #. Consider any Cauchy sequence of elements of KH’. Since K is complete, the 


sequence converges to a function g belonging to K. However, a direct application 
of Parseval’s identity shows that this is impossible unless g € 
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in Section IIT, with the further qualification that ~(—p) = —wW() for 
any real p. Let y ¢ ®’. Suppose that 
FP 
= ui a : 
aes | Mae 





Then the conclusion of Theorem II remains valid if ® is replaced with 
K’, rts replaced with q, and F ts replaced with F’. 
Proof: 

The proof is essentially the same as for Theorem II. The assumption 
that y is odd is needed to verify that the operator corresponding to 
M (with y = 1) isa mapping of the Banach space &’ into itself.* 

Of course Theorem IV implies results entirely analogous to Corol- 
laries I and II. 


VI. ON THE LACK OF SUBHARMONIC COMPONENTS IN THE RESPONSE 


One of the key assumptions relating to the describing-function analy- 
sis of the system in Fig. 1 is that the response x(t) (assuming it exists) 
is a periodic function with period 7 when y(t) is a sinusoid with period 
T. In particular x(t) is assumed not to contain subharmonic compo- 
nents. The techniques described earlier can be used to obtain explicit 
conditions under which this assumption is valid. The following theorem 
contains one such result. A preliminary fact’® that is needed is: if f(t) 
€ Liz and F (iw) ¥ 1 for all real w, then there exists a function h(t) ¢ 
Liz With Fourier transform F(iw) [1 — F(iw)]™. 

Theorem V: Let » denote the space of bounded real-valued measurable 
functions defined on (—©,«). Let F be defined by 


Fy = [fi —aoledr, gen 


where f € Liz and let y and B be as defined in Section ITI. Suppose that 
F(iw) # 1 for all real w and that 


@-1) [ |Awla<1, 


where h(t) has Fourier transform F(iw) [1 — F(tw)|'. Let y € n. Then 
there exists a unique x € n such that x = Fy[x + yj. Further, x(t) 7s 
continuous and of y(t + T) = y(t), then a(t + T) = x(t). 


*If ge KR’, g(t) = —g(t + $T) for almost every ¢. Since y is odd, M preserves 
this property and hence M maps &’ into itself. 
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Proof: 

Arguments very similar to those presented in Ref. 13 can be used to 
show that under the conditions stated in the theorem, F is a bounded 
mapping of y into itself, that the operator (I — F) possesses a bounded 
inverse on 7 and that 


(I — F) Fg = / h(t — r)g(r)dr, g€%. 


Thus, paralleling the proof of Theorem II, the functional equation 
x = Fla + yj can be written as « = Lax where 


Le = (I — F)“F{vle + y] — al. 
Let the norm of an element of 7 be defined by 


Ig ll. = sup | g(¢) |, gen. 


With this norm 7 is a Banach space. It is a routine matter to verify 
that under the conditions stated in the theorem L is a contraction 
mapping of 7 into itself. This implies the existence of a unique solution 
u(t) € 7. 

The continuity of x(t) follows directly from the fact that « = Fy|a + y] 
in which y[x + y] is bounded and f ¢ Lir. 

If yt + 7) = y@), Lisa contraction mapping of the following sub- 
space of 7 into itself: {g |g en, g(@) = g(t + T)} and hence there exists a 
unique solution belonging to this subspace.* This completes the proof 
of Theorem V. 


VII. FINAL REMARKS 


It seems likely to this writer that the contraction-mapping fixed- 
point theorem, and more generally the techniques of functional analysis, 
can be exploited with considerable profit by the control system syn- 
thesist. Indeed one objective of this paper is to stimulate engineering 
interest in these techniques. 

The results in this paper can be extended to cover the analogous 
multiloop multi-nonlinear-element case (i.e., the case in which y, w, », 
and x in Fig. 1 are N-vector valued functions of ¢t, and y represents NV 
nonlinear elements of the type considered earlier). In particular, the 


* Alternatively, observe that x(¢ + 7’) is a solution of the functional equation 
when y(t + 7) = y(t). Since the solution is unique, x(¢ + 7) = x(¢). 
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corresponding extension of Theorem II is given in Theorem IV of Ref. 
14.* 

The writer is indebted to V. E. Benes and H. O. Pollak for reading 
the draft. 


APPENDIX A 


Let F be defined by 


= [He-DoG dr, gex 


where f(t) © Liz. 
We show first that F is a bounded mapping of & into itself. Consider 


nit) = [ft — rarer. 
Using Schwarz’s inequality, 


nO 1s flo — 2) bls) PF Par 


< Ca — 7) Fly) Lar) ([" Le) lar) 


from which 


[ino |'dt = a | [= bot — 71) P-L se) Lae | [@ |dr. (2) 


Since g has period T and fe Liz, 


ms | g(t — 7) Fat | | f(r) | dr = [ \g(t) |’ de [ | f(r) | dr O's 


Hence Fubini’s theorem implies that the order of integration in (2) can 
be interchanged. Thus 


r | a(t) Pdt < [ 1 g(t) Pat ([- ifr) Jar) 


and since A(t) is clearly real-valued and periodic in ¢ with period T, F is 
a bounded mapping of X into itself. 
Consider now the relation between the Fourier coefficients of h and g 
* In Theorem IV of Ref. 14, the basic functional equation is written in terms 
of what corresponds here to w (since x= w-— y, this is, of course, an unimportant 


difference), and the nonlinear functions are permitted to depend periodically on 
t with period T. 
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stated in Section ITI. We have 


r h(t)em"! dt = is [ [reat : “de | grineot gy 


By the same argument as before, the order of integration can be inter- 
changed. Thus 


i Alte", = a Ke she neat | t(r)dr 
s i if g(t — retort | oF) dr 


T 
3 Ghia / g(t) eeorgy 
0 


in which, clearly, sup | /'(inwo) | < o. 


APPENDIX B 


Proof of Theorem I 
The norm of F is: sup { || Fg ||; g ¢ &, || g || = 1}. Using Parseval’s 
identity, || Fg |’? = >> |¥, |: |g. /. Thus || Fl] S sup|F,|, and 
—00 n 
since it is clear that there existsa g ¢ K such that || g || = 1 and || Fg || 
= sup |/,,| — 6, where 6 is an arbitrary positive number, || F || = 
sup | F, |. 


Next, consider the invertibility of the operator (I — F) when 
inf|1— F, | > 0. Let w e ®. The hypothesis implies that 


D1 - Fal wal < @, 


where w, is the nth Fourier coefficient of w. Thus, according to the 
Riesz-Fischer theorem, there exists a z2 ¢ K with Fourier coefficients 
Z, = w,(1 — F,)~*. Parseval’s identity implies that z satisfies the 
equation (I — F)z = w (in the sense that || (I — F)z — w || = 0) and 
that 


|2]| S$ sup|1—F, 7° ||. 
Thus (I — F) possesses a bounded inverse on K. 


The expression for || (I — F)’F || given in Theorem I follows from 
arguments similar to those used to obtain the expression for || F |]. 
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Digital Computer Simulation of a Four- 
Phase Data Transmission System 


By M. A. RAPPEPORT 


(Manuscript received June 21, 1963) 


This paper discusses the performance of a four-phase data transmission 
system in the presence of delay distortion and impulse noise. The tool used 
in this investigation ts digital computer simulation, including a new tech- 
nique for introducing impulse noise. 

The noise studies indicate the usefulness of the eye aperture for measur- 
ing the degrading effects of delay distortion on such a four-phase system. 
Using the eye aperture as a criterion, the effects on performance of sinu- 
soidal, parabolic, quartic, and parabolically bounded sinusoidal delays are 
studied. Curves showing the resulting degradation in performance are 
obtained. These are uscd to find bounds on allowable delay for a certain 
allowable degradation for lines typical of equalized voice or group band- 
widths, usual nonequalized voice-bands, and loaded cable type voice-bands. 


I. INTRODUCTION 


1.1 Nature of Simulation and of the Telephone Plant 


Digital computer simulation can be a powerful tool in the study of 
data transmission systems. This paper begins with a discussion of the 
techniques that have proved useful in applying this tool to various 
data transmission systems, including a new technique for studying the 
effects of impulse noise. These methods are then used for a detailed 
simulation study of a four-phase data transmission system. 

Digital computer simulation, in general, aims at studying some 
physical system by a mathematical model of the system on a digital 
computer. In particular we are concerned with a real data transmission 
system in a real telephone plant. The scope of this paper is thus the 
performance of data systems over a time-stationary transmission path 
chosen at random from an ensemble of such lines. 

The major forms of interference which concern us are this time- 
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invariant transmission distortion and random additive impulse noise. 
Transmission distortion is characterized as nonuniform attenuation 
and/or nonlinear phase as a function of frequency across the transmis- 
sion band of the system. Nonlinear phase is commonly characterized in 
terms of nonuniform envelope delay, i.e., envelope delay distortion. 
Impulse noise is defined here as any randomly occurring voltage (or 
current) disturbances characterized by the occurrence of larger numbers 
of high peaks or pulses of noise than would be present in Gaussian 
noise of the same power, interspersed with long, low-power (‘‘quiet’’) 
periods. There are, of course, many other forms of disturbance in the 
telephone plant, ranging from line dropout and frequency offset to the 
special case of undersea cables where Gaussian background noise is the 
significant disturbance. While some of these factors are amenable to 
simulation, this paper is concerned only with the effects of transmission 
distortion and impulse noise. 


1.2 Reasons for Using Simulation 


The underlying reason for studying data transmission by digital 
computer simulation is the complexity of the data system itself. This 
includes the range and nature of the ensemble of possible telephone 
transmission facilities, difficulties both in specifying and working with 
impulsive type noises in closed form, and the analytic difficulties in 
investigating real modulators and demodulators. This complexity pre- 
sents problems both in analytic and experimental approaches. 

In the laboratory there are two major difficulties: first, obtaining 
insight into the basic workings of the system in an environment which 
is very difficult to control in the laboratory; second, obtaining sufficient 
flexibility to provide a controlled investigation of the wide range of 
conditions encountered in actual practice. 

Whereas in the experimental approach one of the difficulties is to 
see the forest for the trees, the analytical approach has the problem of 
having to clear away too many of the trees to make the forest visible 
at all. That is, the analytic approach often has to make a large number 
of assumptions about the performance of a real system in order to make 
the analysis tractable in closed form. For example, analytical approaches 
generally substitute the more easily handled Gaussian noise for the 
actually present impulse noise. 

It is an attempt to get the best of both worlds that leads one to simu- 
lation. One hopes that the environment and all the factors in it can be 
controlled, without forcing the investigator into too many simplifying 
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assumptions that may lead to misleading results. Because of this sort of 
hybrid approach, simulation has two major goals. First, insight is 
sought into how a system really works. Among other particular aims, 
one seeks the allowable assumptions that can be made in analyzing a 
system, the basic and secondary factors affecting system performance, 
possible new directions in design and so on. Second, simulation hopes to 
produce a catalog of the expected performance of a real system for a 
realistic range of transmission and noise environments. 


II. SCOPE OF FOUR-PHASE RESULTS 


Sections III and IV describe the simulation techniques used and the 
particular four-phase system considered. In this section the results 
obtained are briefly listed for purpose of reference. The results can be 
grouped in four classes: modem (modulator-demodulator) design, 
choice of criterion, numerical specification of performance for particular 
transmission conditions, and application of these numerical results to 
give general transmission design specifications. 

The modem design results obtained (Section 5.1) are on the modu- 
lation envelope shaping. In particular, the desirable amount of overlap 
between successive pulses is obtained. The same results were obtained 
simultaneously and independently in a laboratory test of the physical 
system by P. A. Baker,’ and thus are useful also as a check on the ac- 
curacy of the simulation. 

Section 5.2 presents results on the choice of a criterion for measur- 
ing performance of the system. The factors underlying choice of a 
criterion have been considered elsewhere.” The basic aim is to find a 
simple measure which will have the property of correlating with the 
performance of the system over a range of transmission facilities in the 
presence of impulse noise. It is shown that the aperture or opening of 
the “eye” pattern (see Fig. 7 of Section 5.2) is a reasonably satisfactory 
criterion. Therefore, the eye pattern is used for the presentation of 
results in the remainder of the paper. 

Whenever applicable, general transmission design specifications are 
intertwined with the specific numerical results. Section 6.2 presents 
numerical results on the distortion produced when delay can be defined 
as a sinusoidal function of frequency. These results are interpreted 
to show general transmission design requirements for group (Le., 
40-ke or greater) bandwidths. Section 6.3 gives numerical results for 
the distortion effect of a variety of other delay shapes, for example, 
parabolic delay, as a function of frequency. 
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Sections 6.4 and 6.5 consider two general design questions. The first 
is: given some means of specifying allowable delay, that is, some class 
of transmission lines, find the line producing maximum degradation in 
performance. With this information one can then give bounds on 
performance, if bounds on the delay are specified. For a wide class of 
classical delay specifications, the result is essentially a two-cycle sinu- 
soidal delay across the band. The next section considers the width 
of the band over which delay must be specified to predict system per- 
formance. It is shown that, for a system transmitting N bits per second, 
it is necessary to specify the delay over about 0.7N cycles of bandwidth, 
i.e., from the carrier ++0.35N cycles. 

Section 6.6 sums up the results on delay distortion to produce genera] 
transmission design specifications. The results are given in the form of 
curves of allowable delay vs maximum degradation. Three sets of curves 
are given to correspond to various shapes of delay in real facilities. 

The final section gives some results for attenuation distortion. These 
latter results are not intended to be a systematic presentation, but show 
the magnitude of the effects. 


III. SIMULATION TECHNIQUES 


3.1 Introduction 


This section considers those gencral simulation techniques appro- 
priate to the study of a data transmission system. Many of these tech- 
niques were introduced by R. A. Gibby.’ The main new approach is 
to introduce impulse noise to obtain the conditional probability of error 
given a noise of a specified nature present. The system is considered 
in blocks. For each block we attempt to duplicate mathematically the 
action the real system performs in shaping, or more generally in operat- 
ing on, an electrical signal. This is not an attempt to duplicate the 
action of a particular capacitor or resistor but rather to present mathe- 
matically the performance of an entire system block. Further, advan- 
tage is taken of characteristics of certain system blocks to materially 
simplify the real system. For example, blocks in sequence which are 
commutative can be reversed without affecting the simulation results. 

Fig. 1 shows two block diagrams of a data communications system. 
The upper figure is a common representation of a physical system. The 
lower figure 1s an equivalent model useful for simulation analysis. This 
equivalence is presented as an aid in understanding the nature of com- 
munication system simulation. Hereafter, we will call the upper figure 
the P (or physical) model and the lower figure the S model. 
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Fig. 1— Block diagram representations of communications systems: (a) 
Shvsteal (P) model, (b) simulation (S) model. 
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3.2 The Modulator and the Transmission Line 


The first box in the S model provides a catalog of all possible indi- 
vidual pulses used in the system. Since a digital computer is a diserete 
system, each pulse is defined by a sequence of sample values at a suc- 
cession of evenly spaced time points. lor a binary FSIX system, for 
example, such a catalog would be one pulse each at the basic ie 
and space) frequencies of the system. 

The next step in the S model is to pass each pulse through the trans- 
mission medium. We emphasize that the pulses are acted on by the 
medium before they are encoded into a pulse train. This is justified by 
the commutative properties of filtering (i.e., action by the transmis- 
sion medium) and adding together individual pulses displaced in time 
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(i.e., encoding of information). The third box in the 8 model then forms 
a pattern of pulses corresponding to the given pattern of encoded in- 
formation. 

The operation of the transmission facility on each pulse in the catalog 
is simulated in the frequency domain for reasons to be discussed shortly. 
Therefore, first a Fourier series of a particular pulse is formed in the 
computer. Next this Fourier series is modified by the attenuation and 
phase characteristics of the transmission facility. This operation will 
be discussed in greater detail in Section 5.8. The distorted pulses are 
then reformed in the time domain. These distorted pulses are then used 
to form a signal wave train corresponding to encoded information. 

While the legitimacy of reversing blocks 2 and 3 follows from the 
commutativity of the two blocks, it is not yet clear why we should 
want to do this in a simulation. The basic motivation is the tremendous 
increase in speed when a simulation is performed in this way. Instead 
of having to process a long train of pulses through the transmission 
medium, it is now necessary to process only the small catalog of pulses 
inherent in the modulation technique. The superposition of the string 
of pulses is then a simple operation (which would in any case have to 
be done, no matter what the sequence of the blocks). 

In addition, this approach gives a bonus of defining clearly individual 
pulses and their spectrums. We will see, in the section on sinusoidal 
delay distortion, the insight that is possible into performance of a system 
by the examination of individual pulses. 

It is worthwhile to consider also why the transmission medium char- 
acteristics are simulated in the frequency and not the time domain. 
The underlying reason is that most practical knowledge of the plant is 
presently known in frequency domain parameters. By substituting for 
a Fourier approach a convolution of the impulse response of the line 
with each of the possible input signal pulses, a simulation can be easily 
modified to operate in the time domain. The advantages of reversing 
the sequence of boxes 2 and 3 of the S model hold just as strongly, and 
for the same reasons, in the time domain as in the frequency domain. 

To make the greatest use of the long bit patterns available by simu- 
lating in this way, we desire a bit pattern that is representative of a 
random pulse train. What we actually use is what we will hereafter 
call a pseudo-random pulse train. For some integer h we desire to obtain 
every possible sequence of h bits the same number of times. For example, 
if h equals 3 there are eight possible combinations: 


000, 001, 010, 011, 100, 101, 110, 111. 


Any sequence in which each of these possible combinations of NV 
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bits occurs exactly the same number of times will have the same distri- 
bution as a random sequence for all bit sequences of length up to and 
including N. For example, a sequence where each of the above occurs 
exactly once is given by 


LTD O-0:.0 1-081. 1; 


‘ é a 7 4 
A discussion of generation of such sequences can be found in Peterson. 


3.3 The Demodulator 


Discussion of box 4, the introduction of noise into the simulation, 
will be postponed until later in this section. Box 5 of the S model repre- 
sents the demodulator of the data transmission system. The signal 
processing of the demodulator is simulated on the computer. The basic 
approach is to duplicate the action of each block of the demodulator 
on the sequence of amplitude samples representing the data line signal. 
The output of the demodulator in the S model is then a sequence of 
amplitude samples representing the restored but distorted data pulse 
train. 

Timing recovery in the 8 model is done in two separate steps. The 
first step, represented by box 6, is basically synchronization of the data 
train. The aim is to find the optimum sampling point relative to some 
criterion. Several criteria might be considered, but regardless of what 
criterion is used a synchronized or optimum timing point is obtained. 

The second step in timing, represented by box 7, is to introduce the 
timing jitter which would occur in a physical system. Thus this box 
represents the physical timing recovery error, both that due to the 
jitter inherent in the circuits themselves and the jitter due to distortion 
of the data signal. The effect is to duplicate the degradation in perform- 
ance due to imperfect timing. In keeping with methods used physically, 
the timing recovery signal can be generated either from the received 
line signal or the signal out of the demodulator. 


3.4 Notse Impairment — Performance Evaluation 


The last step in a simulation is to evaluate or measure the performance 
of the system. Some criterion of performance is chosen and is imple- 
mented in the simulation. For example, an eye pattern to measure 
system performance by the eye aperture might be formed. The eye 
pattern is formed by superimposing all possible three-bit intervals. In 
Fig. 7(a), a reference eye is shown. Fig. 7(b) shows some of the traces 
of a distorted waveform, and the resulting eye. The complete distorted 
eye would include all possible three-bit traces. The eye aperture is de- 
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fined as the minimax opening—that is, the maximum opening of the 
“worst”? bit pattern. Various coding schemes can be introduced at 
this point, and the improvement they produce evaluated according 
to the chosen criterion. 

We return now to discussion of box 4, the noise input, postponed in 
the previous presentation. Consider a noise burst of some particular 
size and shape introduced into a data system. We are interested in 
determining the conditional probability of error given this noise. In 
simulation the random occurrence of the noise pulse is replaced by the 
systematic introduction of a noise pulse at each of a large number of 
points in the data pulse train. As discussed above, the pulse train itself 
has each data sequence of a certain length, say 10 bits, occurring the 
same number of times; i.e., the possible sequences of N bits have uniform 
probability of occurrence. This pseudo-random data train is then mixed 
with noise by assuming the noise to be additive. Since a particular 
noise pulse is introduced at a very large (e.g., 4500) number of points, 
and uniformly along the data train, the effect is to introduce the noise 
randomly with uniform probability at all points in the data train. 

The major difficulty in using an approach of this type is to choose 
noise pulse shapes which are representative of the telephone plant. 
In the mathematical sense this problem is at present unsolved. However, 
despite this present restricted knowledge, useful results are obtained for 
the following reasons. First, there is a range of system transmission 
conditions of interest for which relative performance is reasonably 
invariant under a range of different noise shapes. That is, although the 
absolute performance (i.e., the conditional probability of error) of 
each system changes from noise shape to noise shape, the relative per- 
formance, or ranking, of the systems stays the same over this range 
of noise shapes. Thus the degradation introduced by a particular trans- 
mission medium can be measured. Second, it is quite possible to handle 
a fairly large number of noise shapes. For example, in the four-phase 
case nine noise shapes were handled in quite reasonable computer times. 
Thus performance may be catalogued for various kinds of noise pending 
further knowledge. Finally, this implies a third basic use, which is that 
even if the system does perform differently under different wave shapes, 
this very information is useful in indicating both the basic nature of 
and possible improvements in a system. 

The introduction of noise into the simulation can be used in various 
ways in measuring performance of the system. First, if the noise allows 
some ranking of transmission facilities, such as discussed in the para- 
graph above, it may be possible to directly correlate these rankings 
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with some criterion of performance of the system. This would then 
justify using this criterion in practice. For example, a justification for 
use of the eye aperture as a criterion for expected relative performance 
in four-phase transmission will be presented later. Second, it may be 
possible to use such a simulation in the design of better error detecting 
and correcting codes. That is, in the process of analyzing a long sequence 
of bits of a pseudo-random nature, it may appear that certain patterns 
of errors are more likely to occur in a realistic noise environment. Using 
such information, simpler error detecting and correcting codes on real 
lines might be obtainable. This is one approach to the construction of 
error codes on lines in which the memory is of a very complicated nature. 


IV. FOUR-PHASE SYSTEM 


4.1 Physical 


We come now to the four-phase system which will be our prime con- 
cern for the remainder of the paper. The physical system we consider 
has been described by P. A. Baker.’ The modulator of this system is 
shown in Fig. 2. Eight sine waves with the relative phases shown in the 
diagram are generated. These occur in two groups of four, as shown, 
and the data are encoded by choosing alternately from these two 
groups. The information is actually contained in the change in phase 


| : : =e 
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Fig. 2— Modulator. 
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between successive pulses. Since each pulse encodes two bits of infor- 
mation, it has been given the name ‘‘dibit.”” The two alternating channels 
are used to insure there is always a change in phase between successive 
dibits. From the diagram it can be seen that this change is always one of 
four odd multiples of 7/4. The actual shaping of the envelope of the 
sine waves is shown in the diagram. Just what this shaping should be 
to optimize performance was the first major check on the accuracy of 
the simulation. It will be considered in detail in Section 5.1. 

The demodulator of the system is shown in Fig. 3. The information 
is recovered by comparing successive dibits to determine the change in 
phase between them. For this purpose the modulator uses two quite 
similar channels. We consider in detail only the upper one of the figure. 
The incoming signal is delayed by an interval of 1 dibit and then multi- 
plied with the new incoming signal. This therefore results in a multi- 
plication of successive dibit intervals. The output of the multiplier is 
then integrated. The result will be either positive or negative, depending 
on the relative phase of the two successive dibits. A truth table to re- 
create the encoded information is shown on the figure. This truth table 
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Fig. 3 — Receiver. 
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also demonstrates the result of the lower channel, which differs from 
the upper channel only in that it uses an extra 3 cycle of delay. For 
greater detail the reader is referred to the paper by Baker.’ 


4.2 Simulation 


This section is concerned with the application of the general tech- 
niques of simulation to the specific four-phase system. The simulation 
processes a data input train of 512 dibits. This corresponds to 1024 bits 
of input information, or to all possible combinations of four successive 
dibits. An extension of the exhaustive pattern technique mentioned 
above had to be developed for this four-phase case. It is no longer 
sufficient to make a choice between 0’s and 1’s for each information 
time slot. In the first time slot it is possible to pick any one of the eight 
possible signal waves, as shown in Fig. 2. Due to the alternation be- 
tween the two channels of the modulator, it is possible to choose any 
one of four signals for each time slot after the first one. Thus, there are 
a total of (8 X 4 X 4 X 4 = 512) possible four-dibit combinations in 
the pseudo-random data train. For each sequence of four dibits there is 
another sequence different only in polarity; i.e., sequence A is simply the 
negative of sequence B. Since the multiplier eliminates this polarity 
difference, there is a seeming redundancy in using a pattern of this 
length. However, the noise introduced is of one polarity only. Thus this 
redundancy is necessary to obtain representative results with noise 
present. 

There are two channels shown in the demodulator of Fig. 3. Initially 
both channels were simulated. It seems intuitively reasonable that the 
over-all results in these two channels should be essentially the same. 
For example, the signal sequence of initial phases —90, 45 should, with 
the extra 90° delay, give approximately the same results in the lower 
channel as the signal sequence 0, 45 gives in the upper channel. This is 
true within the limitations of slightly differing end effects between the 
two possible data sequences. There is such an image in the lower channel 
for every data sequence in the upper channel. One of the first tests 
performed with the working simulation was to check this hypothesis of 
approximately the same over-all results from the two channels. Over a 
fairly wide range of cases considered, no substantial difference in the 
performance of the two channels on an over-all basis was found. There- 
fore, the results given in this paper were obtained using only the upper 
channel of Fig. 3. The results apply to either channel operating alone, 
or to the system operating with both channels. 

The results presented were obtained using nine basic noise wave- 
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shapes. These are shown in Fig. 4. In the first row are shown three 
sinusoidal pulses. These pulses differ in the ratio of their frequency to 
that of the dibit speed (or equivalently the carrier frequency) of the 
system. Similarly, the second row shows sinusoidal pulses subject to a 
one-sided exponential decay, and the third row shows two-sided ex- 
ponentially decayed sinusoids. Each noise was considered over a range 
of noise to signal amplitudes. This range will be described more fully 
in Section 5.2. The noises shown were picked for two reasons. First, 
they represent a fairly wide range of parameters and shapes. Second, 
impulses idealized in this way are suggested by experimental studies of 
the telephone plant such as that of J. H. Fennick.° 


V. BASIC SIMULATION RESULTS 


5.1 Stmulation Check — Envelope Shaping 


The first step in using a simulation must be to check its performance 
against the physical model it represents. In the case of four-phase 
simulation this first check was provided by an investigation of the 
optimum envelope shaping (in some particular class of functions) in 
the modulator. 
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Fig. 4 — Noise waveshapes; all shapes are shown prior to detection but sub- 
sequent to transmission. 
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Prior to the envelope shaping the basic four-phase pulse is given by 
f(t) = sin [weart + (2/4) ] n = 0,1,2,3,4,5,6,7 (1) 


where n represents the information content of the wave. Consider first 
a raised-cosine envelope shaping. Then a four-phase pulse is denoted by 
(upper indicates upper channel of modulator) 


Psu) os sill (rl + n *) E + 5 cos $ abit | n= 0,2,4,6 
: (2) 
fomal) = sm ( +n *) E _ 5 cos abit’ ee Bir 


A typical sequence of such pulses is shown in Fig. 5(a). However, as 
can be seen in Iigs. 2 or 5(a), this full raised-cosine envelope builds in an 
overlap between successive dibits. One alternative to such shaping is 
phase shift keying—that is, a jump from one phase to another at dibit 
intervals. However, since the transmission lines have finite bandwidth, 
such phase jumps would be distorted in transmission. To provide a 
controlled smooth transition, rather than the uncontrolled distortion 
resulting from the action of finite bandwidth on phase jumps, while 
minimizing the effect of the built-in overlap of the raised cosine, the 
basic functional form of the envelope given by (8) was considered 
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and similarly for the lower channel envelope. The parameter 7’ controls 
the amount of built-in overlap of the modulation envelope. The opti- 
mum value for 7’ agreed when found independently in a laboratory test 
of the working system by Baker and on the computer. This provided a 
check on the simulation. 

The optimum value of T is #. This corresponds to a modified raised 
cosine with about ¥ dibit overlap, as shown in Fig. 5(b) (that is, about 
50 per cent of the maximum overlap of Fig. 5a). The shaping of this 
modulation envelope in effect determines the amplitude spectrum of the 
line signal. In the course of subsequent investigation on the computer, 
another class of amplitude shaping was also considered. A typical mem- 
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Fig. 5 — Envelope shaping: (a) full raised-cosine shaping, (b) modified shaping 
used in present data terminals. 


ber of this class of squared-off raised-cosine amplitude spectra is shown 
in Fig. 6 along with the spectra resulting from a complete raised-cosine 
shaping (that is 7 = 1) and for the optimized T = 3. A sequence of 
values of the parameter b (see Fig. 6) of the squared-off raised cosine 
was considered. No value of b gave performance as good as the modified 
raised cosine with 7’ = 3. 


5.2 Criterion — Eye Pattern 


The measure of performance used in obtaining the results described 
in the above paragraph was the eye pattern (Tig. 7). However, while 
the accuracy of the simulation can be checked by comparing the eye 
obtained in a laboratory test to that obtained by the simulation, this 
does not in itself justify the eye as a suitable measure of performance 
for the four-phase data system. This justification of the eye depends in 
the final analysis on demonstrating a correlation between the expected 
performance of the system in a real noise environment and the opening 
or aperture of the eye. 

Impulse noises of nine different shapes were introduced into the 
simulation. Each of these was considered over a signal-to-noise ratio of 
about 14 db, from noise of +5 to —9 db relative to the signal. The 
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Fig. 6 — Envelopes of amplitude spectra. 
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_ ‘Fig. 7 — Eye patterns. (a) Undistorted eye: tracings result from patterns 

000, 001, 010, 011, 110, and 111; A is eye aperture normalized for undistorted eye 
to 1.0. (b) Distorted eye: figure shows some tracings; total eye formed from all 
possible three-bit intervals. 
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signal-to-noise ratio was defined as the peak undistorted signal amplitude 
relative to the peak noise amplitude. 

The arbitrary nature of this definition is not significant, since it docs 
not affect the ranking or relative performance of various lines. A sample 
of the many cases considered is shown in Fig. 8 for two noises and two 
transmission lines (one of which is back-to-back or reference trans- 
mission). As might be expected, all of the lines perform approximately 
the same at very high noise levels. At such levels the noise completely 
swamps out the signal, and the resulting performance is then the totally 
random effect of the noise alone. We therefore choose the range of con- 
ditional probability of error for which the results are meaningful in 
terms of degradation in performance. To avoid misunderstanding, we 
note that while performance is certainly sensitive to noise waveshape, 
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SIGNAL-TO-NOISE RATIO IN DECIBELS 


Fig. 8 — Conditional probability of error | Pyw(e)| vs S/N [db (peak undis- 
torted signal/peak noise)]. Shows results of two sample noises on a distorted 
and an undistorted transmission line. 
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much of the difference seen in Fig. 8 comes from defining S/N using 
the peak value, and not some average total power measure for the noise. 

The range of conditional probability of error chosen for these tests 
was between 0.005 and 0.125. For a particular transmission line and 
conditional probability of error, there is a range of degradation of 
performance as measured by the change in signal-to-noise ratio for 
various noise shapes. A sample of the noise results obtained is given in 
Table I for a range of transmission lines in the following manner. First 
the transmission line is described with a sufficient number of parameters 
to characterize it. or example, sinusoidal delay is described in terms of 
its amplitude (Bm), its frequency (m) and its phase 6 relative to the 
transmission band. A detailed discussion of each parametric representa- 
tion will be described later. Numerical results are then given for each 
of three levels of conditional probability of error, namely 0.005, 0.025, 
and 0.125. These results are the range in degradation and signal-to- 
noise ratio due to the nine noises considered for the given transmission 
line relative to back-to-back performance. The correlation in perform- 
ance between results obtained using conditional probability of error and 
the eye aperture can be seen by comparing the degradation in perform- 
ance measured by successively smaller P(e) (i.e., for successively smaller 
noises) with the degradation measured by the aperture shown in the 
last column. It is seen that the eye seems to give a limiting value. 

Figs. 11-13 of Section 6.2 are presented in terms of eye aperture. Pres- 
entation in this form is justified by the fact that, for the range of trans- 
mission lines and noise considered, the value of eye aperture correlates 
consistently with the degradation in performance obtained by keeping 
constant conditional probability of error. This justification is one of the 
principal results of this paper. While this does not demonstrate that the 
eye will be a good measure of performance for all possible distortions, 
the wide range of noises and transmission facilities considered indicates 
the eye to be an appropriate criterion for four-phase transmission, at 
least over lines whose principal distortion is nonuniform delay. 


5.3 Transmission Line Simulation 


A typical four-phase data pulse is given by the product of f(t) of 
(1) and envelope (¢) of (8). As mentioned previously, passing these 
pulses over a transmission line was done in the simulation in the fre- 
quency domain. Thus for each of these pulses a Fourier series was 
formed. A typical example of such a series is given in (5), where a, 
the fundamental frequency of the Fourier series, determines the spacing 
of the spectral lines which represent the wave: 
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TABLE I— RANGES OF CONDITIONAL PROBABILITY OF !2RROR 
All numbers are degradation in db relative to no distortion—range repre- 


sents variation over nine noises. 











Degradation (in db) Measured Using | Degradation 
Conditional Probability of Error (in db) 
Delay Measured 
Using 
P(e) = 0.125|/P(e) = 0.025] P(e) = 0.005) Aperture 
Sinusoidal 
Bm = 0.5, m=0.5, 0= 7/2 0-0.3 0.5-0.8 | 0.7-1.2 1.1 
1.5 0.5 0 1.4-1.8 3.1-3.7 | 3.8-4.5 4.9 
1.5 0.5 a /2 2.9-3.5 6.2-6.6 | 8.2-8.9 9.6 
1.0 1 0 1.3-1.7 3.1-3.8 | 4.2-5.0 4.8 
1.0 1 aw /2 2.8-3.7 6.4-7.0 | 8.6-9.3 10.2 
1.0 1.25 a/2 3.0-3.8 7.1-7.6 | 9.7-10.5 11.7 
0.5 1.5 0 0.6-1.0 1.8-2.2} 2.5-3.1 3.1 
1.0 1.5 0 3.9-4.4 8.5-8.8 | 11.4-11.7 12.8 
1.0 1.75 0 4.5-5.1 9.8-10.5 13-14.5) 15.8 
0.5 2.0 a/2 0.3-0.7 1.5-1.8 | 2.1-2.8 3.1 
1.0 2.0 w/2 2.0-2.7 5.6-6.3 | 8.1-9.0 10.1 
1.5 2:5 0 5.6-6.1 | 12.1-12.7| 17.5-19.0) 24.5 
Parabolic 
1.0 bit delay at wear + 0.35 wnit 0.3-0.4 1.1-1.2] 1.6-1.9 1.8 
1.5 bits delay at wear + 0.35 whit 0.8-1.0 2.1-2.5 | 3.3-3.4 3.7 
1.5 bits delay at wear + 0.275 wit 1.1-1.7 3.2-3.7 | 5.0-5.1 5.8 
Wear — 0.425 wnit 
1.5 bits delay at wear + 0.2 wnit 2.5-3.1 6.6-6.8 | 10.3-11.3) 12.4 
Wear — 0.5 whit 
Quartic 
1.25 bits delay at wear + 0.35 wnit 0.2-0.8 0.8-1.3 | 1.7-2.2 1.9 
1.25 bits delay at wear + 0.275 wit 0.7-1.3 2.5-2.7 | 3.5-4.0 4.8 
Wear — 0.425 Whit 
1.0 bit delay at wear + 0.2 whit 2.1-2.6 5.3-5.8 | 8.5-9.8 11.3 
Wear — 0.5 Whit 
Band cutoff cases 
Quadratic delay to wear + 0.35 wnit — | 0.6-0.9 1.4-1.7 | 2.3-2.5 2.5 
then delay uniform 
Delay of Fig. 22, curve a 2.0-2.8 4.6-5.0 | 7.8-8.4 8.8 
Delay of Fig. 22, curve c 1.1-1.6 3.6-3.8 | 5.2-5.9 6.3 
Delay of Fig. 22, curve f 1.0-1.2 2.6-2.8 | 3.5-4.0 4.4 
Attenuation cases 
Sinusoidal delay 
Bm =1.0, m=2.5, @=0 
6-db slope atten. of Fig. 9(a) 2.4-3.2 5.2-6.0 | 7.8-8.5 8.9 
Sinusoidal delay 
Bm = 05, m= 2.0, 6=0 
6-db slope atten. of Fig. 9(a) 2.6-3.2 4.6-6.2 | 7.5-8.1 9.9 
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f(t) env (t) = g(t) (4) 
N 

g(t) = 2d, A (ne) COS [Na t + W(nwo)]. (5) 


A(nao) and ¥(nwo) are the amplitude and phase of the spectrum of the 
pulse across the transmission band of interest. The effect of passing 
such a pulse through a transmission medium is to yield an output 
pulse which can be represented by: 


N 
h(t) = 2 A(nao) R(neo) cos [nwo t + (nw) + vlna]. (6) 


Herc, R(nay) and ¢g(nw) represent the attenuation and phase of the 
transmission medium. The envelope delay D(w) is the derivative of 
g(w) with respect to w. 
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Fig. 9 — Attenuation and delay shapes. 
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The parametric forms of R(w) and D(w) are given in chart form in 
Fig. 9. Aside from a flat passband, the primary form of attenuation 
considered was one which seems to be typical of a large part of the 
voice-band plant. It consists of a flat passband out to some cutoff 
frequency, followed by a slope attenuation of so many db per cycle. We 
note that this is not db per octave; that is, the curve is linear with 
respect to frequency and not with respect to the log of frequency. In 
addition, some attenuations previously introduced, whose spectra are 
shown in Fig. 6, were also considered. Among these is the effect of 
varying the built-in overlap or intersymbol interference, which is, of 
course, also a variation in the amplitude spectrum of the pulses. 

The general forms of delay considered were sinusoidal, parabolic, 
quartic and a parabolic bounded sinusoid. In addition, many of these 
forms were also used with variations in the band edges of the delay. 
For example, sinusoidal delay might be used across 70 per cent of the 
band and then a relatively sharp cutoff delay used at the band edge. 
Finally the simulation is set up to handle frequency by frequency read- 
in of both phase and attenuation across the transmission band. 


VI. DELAY RESULTS 


6.1 Introduction 


The remainder of this article is a presentation of the results achieved 
using the simulation to investigate performance of a four-phase system 
over a telephone channel. While some raw data results on impulse noise 
are presented in Table I, for the reasons described above the remainder 
of the discussion will refer to the eye aperture results shown in curve 
form in a group of figures. 


6.2 Multicycle Sinusoidal Delay — Group Band Transmission Design 


We begin our discussion with the results obtained in an application 
which is at once the simplest to explain, has perhaps the clearest in- 
tuitive explanation of the effects of delay, and yet is of definite practical 
importance. This is the case illustrated in Fig. 9(b), in which the delay 
is a sinusoidal function of frequency across the transmission band of 
interest. 

Such a sinusoidal delay is defined in terms of three parameters. 
First is the amplitude of the delay (Bm of the figure), which charac- 
terizes the peak delay in bit times. Second is the phase (@), which 
represents the position of the sinusoidal delay relative to the carrier 
frequency of the system. Finally, the number of cycles of the sinusoid 
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across the transmission band of interest is given by m. This characteriza- 
tion is of particular interest in group bands. Typical delay shapes which 
arise due to group band separation filters might be those shown in 
Vig. 10(a). In order to permit transmission of data over group band 
channels these delay curves are equalized. The results of equalization 
typically yield a number of cycles of sinusoidal delay, such as shown in 
Tig. 10(b). The designer often has some control both over the magni- 
tude and the number of ripples of delay, with a compromise to be made 
between technical factors and economics. That is, the greater the 
number of ripples required, in general, the more stages of equalization 
are needed, and thus there is an economic constraint involved. 

The basic results on multicycle sinusoidal delay are shown in Figs. 
11 through 13, where Fig. 13 is an overlay of the preceding two figures. 
A number of conclusions can be drawn from these results. For design 


5 


(a) 


DELAY IN BITS 
a 


(b) 


{e) 
WcaR 0.35 HB)7 CAR Mcart 0.35 Hair 


Fig. 10 — Group bands: (a) unequalized, (b) equalized. 
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Fig. 11 — Aperture vs delay for sinusoidal delay; all curves use @ = 0, 


B —~ We 
p= oos| (* “m+ 0]. 
Whit Mbit 


Aperture = 0 indicates system makes errors even with no noise. Peak-to-peak 
delay = 2Bm. Curves for Bm = 0.5, 1.0, and 1.5 bits peak delay. 





purposes, two of these are most significant. First is the general improve- 
ment in transmission as the number of cycles of delay is increased while 
holding the peak delay in dibits constant. Second is the relative pref- 
erence for an odd number of cycles of delay across the passband. This 
is clearest if 9 = 0 (i.e., the aperture for m = 3, 6 = 0 is larger than 
for m = 4, 6 = 0). If 6 = x/2 there is not much difference between 
m = 3 and m = 4. However, since the larger m will generally be more 
expensive to achieve by equalizing, m = 3 would again be preferred. 
We consider first the improvement in transmission for constant peak 
delay. To a first-order approximation this can be explained on the basis 
of echo theory.” Since the delay is sinusoidal, the interference intro- 
duced is due primarily to echoes. In particular, simulation shows the 
first echo on each side of the pulse transmitted to be the primary cause 
of interference. The amplitude of any echo is proportional only to the 
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Fig. 12 — Aperture vs delay for sinusoidal delay; all curves use 6 = 7/2, 


B —~ We 
p= 2 cos (* =) m-+o]. 
Wbit Wbit 


Aperture = 0 indicates system makes errors even with no noise. Peak-to-peak 
delay = 2Bm. Curves for Bm = 0.5, 1.0, and 1.5 bits peak delay. 





amplitude of the sinusoidal phase (B), and is independent of the other 
parameters of the delay. Thus as the number of cycles of sinusoidal 
delay increases, holding the product Bm constant, B decreases and so 
does the interference. Fig. 14 shows a typical four-phase system pulse 
and the echoes produced by the sinusoidal delay for various numbers 
of cycles (in frequency) with constant peak delay. The advantages of 
the simulation being able to show individually distorted pulses in 
such clear detail is most evident at this point. 

On the other hand, the position of the echo is determined solely by 
the number of cycles of sinusoidal delay across the band; that is, the 
distance of the echo from the main pulse is directly proportional to m. 
The preference for odd numbers of cycles of delay can be explained, to 
a first-order approximation, as follows. The primary interference is the 
first echo in each direction. As the number of cycles of delay increases, 
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the effect on this echo is to move it farther and farther from the main 
pulse. We note that the amplitude of the echo is decreasing for a con- 
stant value of peak delay. In practice, this means that as the number of 
cycles of delay is increased, the interference progressively phases in and 
out with respect to the time at which the demodulated pulses are 
sampled. In other words, while the echo is always present, its major 
point of interference oscillates between the sampling instant and midway 
between the sampling instants of the demodulated pulse train. This 
oscillating phasing in and out with respect to the sampling time of the 
interference from the echo produces alternately more and less inter- 
ference at the sampling instants. 

An example is shown in Fig. 14. The peak of the first echo comes 
midway in the adjacent time slots of the two-cycle interference. This 
produces a maximum distortion in phase in the adjacent time slot — 
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Fig. 13 — Combination of Figs. 11 and 12: 
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aperture = 0 indicates system makes errors even with no noise; peak-to-peak 
delay = 2Bm. 
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DIBITN-2 DIBIT N-1 DIBIT nN | DIBIT N+1 DOIBIT N+t2 


Fig. 14 — Effects of sinusoidal delay in terms of echoes: (a) basic four-phase 
pulse, (b) echoes from two-cycle/band sinusoid, and (c) echoes from three-cycle/ 
band sinusoid. 


that is, at the sampling instants. For the three-cycle interference, the 
echo reaches a peak midway between time slots. It therefore produces 
less distortion in either of the two time slots it interferes with, and a 
correspondingly lower maximum distortion. 


6.3 Delay Results Useful for Voice-Band Design 


This section introduces and discusses results over a variety of delay 
characteristics. The curves representing these characteristics share the 
property that they have no more than 3 minima. Their general shapes 
are such that they can be made to represent a wide variety of voice- 
band channels. 

The first set of curves, shown in Fig. 15, is simply an enlargement 
of the effects of sinusoidal delay for less than two cycles of sinusoid. 
Fig. 16 shows some typical voice-band delay curves. For each an ap- 
proximating sinusoid (with its associated parameters) is given. The 
results in Fig. 15 are for 6 from zero to 7/2 radians. However, since the 
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Fig. 15 — Aperture vs delay for sinusoidal delay — expansion for low m of 


Fig. 13. 
Bm W — WW 
Delay = — cos ( ) mee]. 
Mb it Mbit 


Aperture = 0 indicates system makes errors even with no noise present; peak-to- 
peak delay = 2'Bm. 





results repeat in successive quadrants (i.e., 9 = m gives essentially the 
same results as 6 = 0) the curves represent the full range of 9. 

Figs. 17 and 18 give results for quadratic and fourth-power delay 
distortion. The curves represent delay symmetric with respect to the 
carrier, and displaced from the carrier by varying percentages of the 
bandwidth of interest. It is worthwhile to note again that these curves 
are, as are all the curves given in the results, normalized with respect to 
the carrier frequency and the bit speed. Thus one can apply the results 
to any frequency range and corresponding bit speed. The curves are 
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normalized for a 13 carrier cycle per dibit system. Thus, for example, a 
typical system at this speed would have an 1800 cycle carrier and 
transmit 2400 bits per second, i.e., 1200 dibits per second. Again, the 
same curves apply to a 48,000-bps system using a 36-ke carrier. 

The results presented to this point represent relatively simple delay 
shapes—that is, sinusoidal, quadratic, or quartic delays. It will be shown 
in Section 6.5 that the same results apply within very reasonable limits 
to any delay curve which is identical with one of the delays considered 
over & minimum of 70 per cent of the band. In addition one class of 
relatively complicated delay shapes was also investigated. These delays 
are parabolically bounded sinusoids, and are described in Fig. 9(e). As 
the product of a parabola plus a constant with a sinusoid they can be 
used to represent a wide class of various delay shapes. The results are 
given in Figs. 19 and 20 for m = 2 and m= 8 and for a range of values 
of the ratio a,, to K. It will be seen that these ratios are particularly 
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Fig. 16 — Representations of some typical lines by sinusoidal approximations 
(primes are approximations). K = 2 links K carrier, ON = 3 links ON carrier, 
and H = 300 miles of H44 loaded cable. 
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Fig. 17 — Apertures for parabolic delay; aperture = 0 indicates system makes 
errors with no noise. 
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Fig. 18 — Aperture vs delay for fourth-power delays. 
954 


955 


FOUR-PHASE DATA TRANSMISSION 















































ayn Lysadv 





TOTAL Aps+K:L.e. TOTAL DELAY POSSIBLE AT Wcag + pir (0.35)) 





DELAY (BITS 


Fig. 19 — Aperture vs delay for parabolic bounded sinusoidal delays, 6= 0. 
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Fig. 20 — Aperture vs delay for parabolic bounded sinusoidal delays, 6 = 90°. 
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uscful in the setting of curves of maximum degradation for per- 
formance in the voice band. 


6.4 Line in a Given Class Producing Most Degradation 


The simplest generally used way of characterizing delay is to specify 
the maximum allowable delay across some percentage of the trans- 
mission band. For example, one might say that the maximum delay 
across 70 per cent of the transmission band is to be no more than 4 
dibit (making a tacit assumption that the minimum delay is 0). It is of 
interest to specify the worst possible delay shape meeting such a re- 
quirement — that is, to find the delay shape falling within this allow- 
able maximum which produces the worst degradation in performance. 
For a line of reasonably good performance, say for eye openings greater 
than 0.6, an answer to this problem has been provided by R. W. Lucky.’ 
However, since the lines he allows include such pathologies as discon- 
tinuities in the delay, the limiting cases must often for our purposes be 
considered nonrealizable. It is therefore of interest to find the worst line 
out of the classes we have considered, using the above criterion of 
“worst.’? For a check this worst line then can be compared to that 
obtained by Lucky. 

If the limiting value of delay is defined across 60 per cent of the band 
or greater, the worst performance in the classes discussed above is that 
obtained with a slightly less than two-cycle sinusoid across the trans- 
mission band. Fig. 21 shows in graphic form the difference in perform- 
ance between such a two-cycle sinusoid and various other delay charac- 
teristics having the same maximum delay across a given percentage of 
the band. For the range of reasonably good transmission lines which 
Lucky has considered, Lucky’s ‘‘worst” line produces only about one 
db greater degradation in performance as measured by the respective 
aperture values (or equivalently by the impulse noise performance; see 
Section 5.2) than this two-cycle sinusoid. When one remembers that 
Lucky’s lines are in general quite drastic in their shapes, it seems rea- 
sonable to use this two-cycle sinusoid as an upper bound through the 
remainder of this paper. 


6.5 Effect of Delay at the Edges of the Band 


Examination of the spectrum of an individual pulse shows that the 
energy falls off very rapidly toward the edge of the band. Now when 
timing is performed using the line signal, the energy near the sides of 
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Fig. 21— Examples of performance range meeting classical delay require- 
ments; frequency in multiples of bit speed from carrier. 


the band can (and for certain nonrandom patterns does)* contain a 
great deal of the necessary information. On the other hand, for timing 
performed from the data signal, in general for timing from random data, 
and for recovery of the data itself, the edges of the band contain only a 
relatively small amount of the information. Thus it would not appear 
necessary to specify the delay as accurately across the entire trans- 
mission band of interest. 

A number of delay curves were used to check this supposition. Fig. 22 
shows some of the cases which gave the widest variation in performance. 
These lines have respectively quadratic (curves a, b, c) and fourth- 
power (curves d, e, f) delay as a function of frequency for the frequency 
FANE Wear — 0.35 writ tO Wear + 0.85 wit. However, the eye apertures 

* Timing for random patterns even when using the line signal gets most of its 


energy from the center of the band where the spectrum has much greater ampli- 
tude. This fact was pointed out to the author by M. A. Logan.’ 
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Fig. 22 — Bandwidth requirements, all curves symmetric about wear: (a) 
quadratic delay,(b) fourth-power delay. 


for even the extreme cases shown vary by less than 2 db, and in most 
cases that seem to be of practical interest a variation of less than one 
db was found. The same results held for the impulse noise performance 
discussed in Section 5.2. 

Similar results were found for delays in which the center band delay 
was a sinusoidal or parabolically bounded sinusoidal function of fre- 
quency. Therefore, one concludes that if delay is specified for the fre- 
quency range from oar — 0.35 writ tO wer + 0.35 opi, then for the 
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recovery of the data signal it is not worth the cost or effort to try to 
equalize delay beyond about 70 to 75 per cent of the transmission band. 


6.6 Voice-Band Transmission Design 


Historically, the specification of delay has been given in a staircase 
arrangement. This is equivalent to setting a sequence of pairs of check 
frequencies and the limitation on delay between them. Thus, for ex- 
ample, a typical delay specification might be 


Wear 0.3 Wbit Wear + 0.3 Wpbit 1.5 dibits delay 


Wear — 0.4 Wbit Wear +- 0.4 Obit 2.5 dibits delay (7) 
Wear — 0.5 Wbit Wear + 0.5 Wbit 4.0 dibits delay. 


When delay is specified in this manner, no account is taken of the wide 
variation in performance of delays of various shapes all meeting the 
basic requirements. Thus, for example, the delays shown in Fig. 21 all 
meet the requirements listed above, yet have eye apertures ranging from 
0.45 to 0.85. In effect, one is faced with the choice of either placing too 
strict a requirement for many delay shapes, or not in truth being able 
to guarantee that delays meeting a particular requirement will have no 
more than a certain allowable degradation. 

These results suggest a somewhat more complicated specification of 
delay requirements for good transmission design. One way of doing this 
is to use a single equation and vary the parameters of this equation to 
allow for various delay shapes — for example, to bound the delay by 
ap, wo + K. By choosing various ratios of a», to K one can allow for a 
wide variety of delay shapes. 

In the voice band there are three major shapes of delay which must 
be considered: namely, the usual single-minimum parabolic-type delay 
shape which arises from carrier transmission, the slope-type delay 
which arises from loaded cable, and the more rectangular delay with a 
ripple across the transmission band which results from equalizing. The 
three curves of Fig. 21 are examples of these types of curves. To repre- 
sent these classes, consider three values of the ratios of a,, to K. For the 
equalizer ripple-type line a ratio of ap; to K of 1 to 1 was chosen. For a 
carrier transmission delay a ratio of ap; to K of 4 to 1 was taken. For the 
loaded cable-type delays a ratio of 8 to 1 was chosen. For each of these 
delay shapes a set of design curves for various allowed degradations was 
derived. In each case the maximum delay was found such that a specified 
degradation in performance would not be exceeded. As usual, the 
allowable degradation is in aperture or equivalently in impulse noise 
performance. These curves are shown in Figs. 23, 24 and 25. 
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Fig. 23 — Design curve no. I. Maximum allowable delay for ap,./K = 1 to give 
indicated degradation: 


D = [aps/(w — Wear)? + K] cos (nw + 8). 


Curves give maximum degradation; in general, degradation should be less for 
delays falling within bounds shown. Note that flat delay introduces no distortion. 


-0.3 0, -0. 


It is not necessary for the delay of a particular line to meet all the 
requirements of all three sets of curves in order to have a particular 
allowable degradation. On the contrary, if a given delay falls within any 
one curve for a particular allowable degradation in performance then it 
will meet this degradation requirement. Thus, in the field or in design, 
it is necessary to consider three curves of allowable delay to determine 
if a particular delay shape will meet a particular degradation require- 
ment. 

In the curves, delay has been specified only up to 70 per cent of the 
band. This is in keeping with the previous results on the non-necessity 
of specifying delay beyond this point. However, it is true that there is 
some range in the actual performance due to the effect of delay beyond 
the 70 per cent limit. In addition, as discussed in the section on worst 
lines, these delays are not the actual worst lines as found by Lucky. 
To allow for these two factors, an additional 1-db margin is built into 
each of the curves. It is felt that only in the very rarest of circumstances 
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give fadscated degradation: 
D = [aps/(w — Wear)? + K] cos (nw + 6). 


Curves give maximum degradation; in general, degradation should be less for 
delays falling within bounds shown. Note that flat delay introduces no distortion. 


will this margin be insufficient, and then only by a very small additional 
amount of degradation. 


VII. ATTENUATION RESULTS 


The scope of this study did not include a systematic investigation of 
the distortion due to attenuation alone or to a combination of attenua- 
tion and delay. However, certain representative results are discussed 
here to indicate some of the effects of attenuation. The simulation can, 
of course, handle any desired attenuation. 

The spectrum has symmetric components with respect to the carrier. 
Thus for symmetric delay (in particular for no delay distortion), slope 
attenuation across the transmission band [i.e., attenuation = k, (fre- 
quency) + k»] should produce an effect equivalent to a flat loss of value 
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Fig. 25 — Design curve no. III. Maximum allowable delay for ap;/K = 8 
to give indicated degradation: 


D = [ays/(w — wear)? + K] cos (nw + 6). 


Curves give maximum degradation; in general, degradation should be less for 
delays falling within bounds shown. Note that flat delay introduces no distortion. 
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equal to the attenuation at the carrier frequency. This line of thought 
was essentially verified using the simulation. In the presence of delay 
unsymmetrical with respect to the carrier frequency, however, this is 
no longer true. This also indicates the nonadditive nature of the effects 
of attenuation and delay. In any study of the combined effects of at- 
tenuation and delay distortion, it is this nonadditive nature of the 
interaction which is the most important single fact. 
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The general form of attenuation considered was linear on a plot of db 
versus frequency as shown in Fig. 9(a). This attenuation shape was 
found to be typical of voice-bands by Alexander, Gryb, and Nast.® The 
cutoff frequency was taken as the carrier minus 20 per cent of the bit 
speed. For example, with an 1800-cps carrier operating at 2400 bps, this 
would be at 1800 — (0.2) (2400) = 1320 eps. The slope was measured 
from this cutoff frequency (wear — 0.2 wpit) tO Wear + 0.385 wit. Slopes 
ranging from 4 to 12 db over this range were investigated. 

Table II shows clearly the nonadditive interaction of the attenuation 
plus delay. Both delays shown in Table II are sinusoidal and respectively 
of odd and even symmetry with respect to the carrier frequency. The 
results are normalized to zero attenuation at the carrier. 


TABLE IIT—INTERACTION OF ATTENUATION AND DELAY 


























Apert 
Amplitude Delay Attenuation pep on y Delay 
enuation 
Bm 0 m 6-db slope 0.45 0.36 
1.0 0 2.5 
Bm 6 m 6-db slope 0.70 0.32 
0.5 w/2 2.0 
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NEASIM: A General-Purpose Computer 


Simulation Program for Load-Loss 
Analysis of Multistage Central 
Office Switching Networks 


By R. F. GRANTGES and N. R. SINOWITZ 


(Manuscript received November 7, 1963) 


Blocking probability is the most frequently required performance charac- 
teristic in traffic studies of complex central office switching networks. Deter- 
mining this quantity without actual measurement ts a difficult task. To aid 
the communications system designer, a simulation program has been prepared 
which produces useful estimates of blocking probability for a large class of 
networks. The program is based on a simplified mathematical model for the 
analysis of switching networks developed by C. Y. Lee, and thus differs 
from conventional simulators in that it simulates a mathematical model 
rather than a traffic-handling system. 

Although Lee’s model 1s widely used, tts utility has been limited by com- 
putational difficulties encountered in networks of realistic size and com- 
plexity. This limitation is in most practical cases removed by the program, 
which features rapid input preparation, short computer runs, and spectfica- 
tion of the desired precision of the results as an input parameter. Moreover, 
the program allows for the incorporation of more a priori information 
about the actual behaviour of switching networks than ts included in Lee’s 
model, thereby leading to a more accurate estimate of blocking probability. 

The simulator has been programmed for the IBM 7090 computer, but 
the concepts are machine independent. 
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I, INTRODUCTION 


Determining the traffic performance of complex multistage central 
office switching systems without actual measurement can be a major 
problem for the communications engineer. While probability theory has 
been successfully applied to a wide variety of telephone traffic prob- 
lems,!?3 a precise formulation of a mathematical model completely 
describing the multistage switching system has thus far not been found. 

No systematic approach exists which completely accounts for the 
gross complexity encountered in large-scale congestion systems, but 
several authors have contributed significantly to the theory, notably 
C. Jacobaeus,**® K. Lundkvist,> A. Jensen,’ C. Y. Lee,® A. Elldin,® R. 
Fortet,!° and P. LeGall." More recently, V. E. Benes'?-!5 has initiated 
“an attempt to describe a comprehensive point of view towards the 
subject of connecting systems.” Although the engineer does not yet 
have a comprehensive theory, he does have a valuable tool in computer 
simulation. 

Simulation of telephone traffic flow has a long history in the Bell 
System. As early as 1907, a rudimentary simulation was undertaken to 
improve switchboard performance. Artificial traffic was generated by a 
card-drawing technique, and the simulation was used to verify a semi- 
mathematical analysis of the loads which could be handled by a team 
of operators meeting an average delay criterion. In the ensuing years, 
simulation techniques have been aids in the study of complex traffic 
problems, such as the effect of limited sources on graded multiple capac- 
ities, the efficiency of random slipped multiples, the capacities of various 
alternate routing plans, and the distribution of delays under various 
trunking plans. The traffic load capacity of the No. 1 crossbar network 
was largely determined by the load-loss relationships in the link and 
junctor patterns obtained from elaborate simulations begun in 1936. 
This was the first time that the capacity of a largely complete system 
had come under study by simulation methods. A 10,000-line No. 5 
crossbar office was simulated in 1948 by a specially designed machine 
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which coordinated the efforts of four operators, providing significant 
data for the traffic engineering of this system.'° 

In recent years the high-speed electronic digital computer has proved 
an effective tool in large-scale traffic simulations.”-” For this class of 
simulations, special computer programs are written which usually con- 
tain: 

(i) a logical description of the system under consideration, 
(iz) a procedure for generating and offering traffic to the system, and 

(i277) a method for extracting and recording the desired system charac- 
teristics. 

These programs may be called “special-purpose” simulators — in the 
sense that they are written for the purpose of studying a specific traffic- 
handling system. A variety of performance data may be obtained, in- 
cluding: 

(2) probability of blocking at various loads (load-loss data) ; 

(27) delay distribution, including average delay on calls delayed; and 

(227) mean queue lengths. 

Of these, the most frequently required datum is the probability of loss 
(or delay). 

These simulation programs have produced a large amount of useful 
information, but their application has not been widespread because of 
the considerable programming effort required. To reduce programming 
effort, various ‘“‘general-purpose”’ traffic simulation programs have been 
written.*4:25.26 However, it must be understood that each program is 
only ‘general’ with respect to a particular class of traffic systems. 

The multistage central office switching system is an example of a class 
of traffic-handling systems for which no general-purpose simulator has 
heretofore been written, although much has been accomplished by 
special-purpose simulations written for specific switching network 
arrangements.” Because the use of these network simulation programs 
has been greatly restricted by the cumbersome programming and input 
preparation required, a strong need has developed for a quick, easy-to- 
use, general-purpose simulation technique. 

To meet this need, the authors have developed a computer program 
which, with a minimum of user effort, will produce useful estimates of 
blocking probability for a very large class of multistage switching net- 
works. The program is based on a simplified mathematical model of 
switching networks developed by C. Y. Lee,’ and differs in approach 
from programs referred to earlier in that a complete description of the 
traffic-handling system itself is not given to the computer; rather, the 
program simulates the behavior of Lee’s analytical model. Deriving its 
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name from this view of its operation, the program has been acrony- 
mously named NEASIM — NEtwork Analytical SIMulator. 

While Lee’s model is probably the most widely used analytical model 
for multistage matching networks, its utility has been severely limited 
by the computational difficulties associated with networks of realistic 
size and complexity. This limitation is in most practical cases removed 
by the program, which features rapid input preparation and short (hence 
economical) computer runs. I'urthermore, unlike many simulations in 
which the reliability of results must be assessed on an a posteriori basis, 
the analytical simulator admits of an a priori appraisal so that desired 
precision becomes an input parameter. 

The probability linear-graph model, basic to the NEASIM approach, 
is described in Section II, where its use in switching network analysis 
is explained. The philosophy of the program together with a general 
description is presented in Section III. Two key program routines are 
described in Section IV. Reliability considerations are given in Section 
V. Finally, Section VI discusses program modifications which can in- 
crease the validity of the NEASIM estimate. 


Il. THE PROBABILITY LINEAR-GRAPH MODEL 


In 1955 C. Y. Lee,’ extending the earlier work of Kittredge and 
Molina, presented a simplified mathematical model for the analysis of 
switching networks. Since the NEASIM program simulates the be- 
havior of this model, a description of the model is given (Section 2.1). 
An example to illustrate how the model is applied is given in Section 
2.2; the computational difficulties which may be encountered in the 
analysis of practical networks are then discussed — thus pointing up the 
need for the simulation program. Section 2.3 explores further the notion 
of blocking probability for a network and the use of the linear-graph 
model in determining this quantity. 


2.1 The Model 


Consider a crosspoint network in which each input can be connected 
to any output by the operation of appropriate crosspoints. Let P:(j,k) 
be the probability that all paths through the network between input 7 
and output k are busy at time ¢. 

Associate with each link 1; of the network a binary-valued random 
variable X;‘” whose value represents the state of the link at time ¢. 
The NEASIM convention is: 0 represents idle and 1 represents busy. 
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Since the concern of telephone traffic engineers is with ‘‘busy-hour” 
traffic, it is usually assumed that 

(a) the busy-idle distributions of the link random variables are sta- 
tionary (or homogeneous) in time. 
That is, for any JN, 


Pr{X,,° = d.3 7 = 1,-°°-,N} = Pr{Xian” = 6,37 = 1,-°-,N} 


for all h, where 6, = 0 or 1 and the index 7 runs over all the links in the 
network. A consequence of this assumption is that P;(7,k) is also sta- 
tionary in time, for all 7 and k, and the subscript ¢ will henceforth be 
omitted. 

Let us now fix our attention on two terminals, one on each side of 
a crosspoint network in which 

(b) all of the switches are nonblocking,* 
and in which 

(c) there ts no connection path between any switch and itself. 
Then the configuration of possible paths through the network between 
the two terminals can be represented by a two-terminal cycle-free linear 
graph with directed branches, in which the nodes of the graph represent 
network switches and the directed branches of the graph represent 
network links. Consider, for example, the network depicted in Fig. 1. 
The possible path configuration seen between terminals A and B is 
indicated by heavy lines, and the corresponding graph is as shown in 
Fig. 2(a). 

Next, assume that 

(d) P(j,k) ts independent of j and k, 
and we can speak of P(j,k) = B as the probability of blocking of the 
network. The notion of blocking probability will be further explored in 
Section 2.3. 

Finally, Lee makes the simplifying assumption: 

(e) the link random variables, X“, are independent. 
This assumption, which is frequently made to render analysis managea- 
ble, is the principal weakness of the model and will cause the results to 
depart from reality in varying degrees — depending on the particular 
network. In general the model will tend to overestimate blocking. The 
problem of obtaining realistic results is discussed further in Section VI. 

Each of a large class of switching networks can therefore be repre- 


* Lee’s requirement that the switches be nonblocking is actually not restric- 
tive, and can be relaxed by an appropriate adjustment of the link occupancies. 
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Fig. 1 — Simple crosspoint network with possible paths between terminals A 


and B indicated by heavy lines. 
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OCCUPANCY = p p 





Fig. 2— (a) GRAPH of the network of Fig. 1, with occupancy p on each of 


the branches. (b) The subgraphs of the GRAPH. 
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sented by a simplified model called, by Lee, a two-terminal probability 
linear-graph in which 

(t) switches are represented by nodes, and links by directed branches, 
and 

(72) assumptions (a)—(e) hold. 
The mathematical object, the two-terminal probability linear-graph, 
will be referred to as GRAPH in the sequel. 

Once the GRAPH of a network is obtained, the calculation of the 
blocking probability can proceed in a straightforward manner. 


2.2 An Example 


As a first illustration, consider the GRAPH of Fig. 2(a). Let E' be the 
event that there is a path through the GRAPH,* EH; be the event that 
branch 06; is idle, and p; = Pr{b; is busy}. Then 


E =A,UA,UA;,U 4A, 
where the paths, A;, are 
Ay = ky a} EB; N Ey 


A,= EN EN Bs 

A,=E,N EN FE; 

Ay= EN E.N Eg. 
The blocking probability is then 


B=1-—Pr{B} 
1-— Pr{A, U A, U A; U A} 


4 


4 
1 - Dy Pr{ A; aia 


Pr{A; a) A; 
Bers 
4 


— >) pr{4;N A; N Aj} + Pr{Ai.N 4294; Ad. 


Now the assumption of independence gives 
Pr{H, --- Ej} = Pr{H#} --- Pr{E} 
= Girt: Qj qQ=1-—p,; 
* Having adopted the GRAPH model, we speak of a ‘“‘path through the 


GRAPH”? rather than ‘‘a path through the network,” and say that ‘‘a branch is 
busy or idle’”’ rather than ‘‘a link is busy or idle.”’ 
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whence, for the case in which p; = 7p for all 7, 
B= qi — 4q' + 29° + 4q° — 4¢° +1. (1) 


In general, given a GRAPH G with m different link occupancies 
Pi, °** 5 Pm, the procedure just illustrated will yield the blocking 
polynomial of the GRAPH: 


Bg = Be(pi; or Dn). 


As a computational tool the utility of the GRAPH model decreases 
with increasing complexity of the GRAPH’s geometrical structure. 
When the GRAPH geometry grows more complex, the blocking poly- 
nomial Bg becomes cumbersome — admitting a greater possibility 
for error in its determination. Moreover, once Bg is found, one still has 
to substitute numerical values for pi, °-+, Mm into the polynomial 
to obtain a result. As an example, for the GRAPH of moderate com- 
plexity shown in Fig. 3, the blocking polynomial is 
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Fig. 3 — A GRAPH of moderate complexity. 
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b= p 

+ p'q(8A") 

+ p°q(4A* + 244°B*) 

++ p’q’(24A*B* + 32A°B*C) 

+ p'q'(8A*C* + 484°B’°C? + 8B°D + 6B') (2) 

+ p'¢(32AB°C’D + 24B'C") 

+ p'q°(4C0° + 24B°C*D’) 

+ pq'(8C*D") 


+ @(D*) 
where: 
A =p + Qpr 
B=p-+ ap: 
C = pt gp: 
D=p+ qn 
GS 1 = Di 


Faced with computing B in (2) over a range of occupancies, an engineer 
would surely resort to a computer. The essential computational diffi- 
culty is that one is confronted with expressions of the form 


Pr{ A; U eves U A} 


where the paths A;, --- , A; are not disjoint. 
It is interesting to note that the method of approach just described — 
which may be called the ‘path enumeration approach” — is not the 


only way to proceed and is indeed not the most efficient. A second pro- 
cedure for finding the blocking polynomial may be called the “combi- 
natorial approach” and is best illustrated by example. 

Since all the branches in the GRAPH of Fig. 2(a) are busy with proba- 
bility p, a moment’s reflection shows that the blocking probability can 
be written as 


B= p + GD Beaiennk 1 + D0 Dawheraph 2 + @ - Bsuberaph 3 
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where Bsubgraph 1, Bsubgraph 2 Bsubgraph 3 are, respectively, the blocking 
probabilities of the GRAPHs indicated in Fig. 2(b). But by inspection 
we have 


Buubgraph 1 = (L — 9°) 
Baubgraph2 = (1 — q°)’ 
Beubgrsph 3 = [1 — (1 — p°)ay, 
so that 
B= p+ 21-7) +7 - 1 pv’) 


is the blocking polynomial (1) expressed in another form. 
The GRAPH of Fig. 4 is more representative of the type of GRAPH 
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Fig. 4— Typical GRAPH geometry of realistic central office network. 
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encountered in modern central office networks. Determination of the 
blocking polynomial in this case — by either the path enumeration or 
combinatorial approaches — is a formidable task indeed. If the GRAPH 
model is to be useful to the modern engineer, some means of handling 
such complex GRAPHs must be made available. 

When seeking performance measures of a network, the engineer 
is not really concerned with the explicit polynomial representation 
Be(pi,°+* , Pm); what he would like is a curve (or set of curves) dis- 
playing this functional relationship. The simulation program described 
in the following sections, when given the link occupancies p1, --- , Dm, 
produces, with predictable precision, a numerical estimate of Bez . 


2.3 Blocking Probability and the Linear-Graph Model 


When several kinds of traffic are handled with different disciplines 
in a network which may have several characteristic graphs, blocking 
probability for the network can only be meaningfully defined relative 
to the persons or terminals encountering blocking. For example, the 
blocking encountered by a call originating and terminating in the same 
central office may not be the same as the blocking encountered by an 
incoming call. In general, a network is required to handle several 
“classes” of connections and the engineer is concerned with the blocking 
probability for each of these classes. We shall limit our discussion of 
blocking probability to one class, that is, a subset of all input-output 
pairs in which each input-output pair has the same graph and for which 
it is reasonable to suppose that the traffic between every input-output 
pair is identical* with that of every other pair. Without loss of generality 
we can therefore assume that the network has one class, so that when 
speaking of the blocking probability of the network, we shall mean the 
blocking probability of the class. (These remarks form the basis of 
assumption (d) of Section 2.1.) 

Having thus limited ourselves to one class of connections, we have still 
to define the blocking probability of a network in a manner which is 
in agreement with the generally familiar definitions. Here we again en- 
counter difficulty. The authors agree wholeheartedly with Benes (Ref. 
15, p. 2805), that ‘In fact, not even the definition (let alone the calcu- 
lation) of the probability of blocking has received adequate treatment 

Syski (Ref. 3, p. 198), after a long series of prefatory remarks, defines 
two quantities, time congestion S(é) and call congestion z(t) for the case 


* That is, every input calls every output at the same rate with the same hold- 
ing time distribution and with the same lost calls disposition. 
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of a simple full-access trunk group. The time congestion is the probabil- 
ity that all trunks in the group are busy at time ¢; the call congestion 
is the conditional probability that the group is blocked when a call 
arrives at the instant ¢. Under the input assumptions and for equilibrium 
conditions, time and call congestions are independent of time and are 
denoted by S and 7, respectively. S is equivalent to the fraction of time 
during which congestion is encountered, while 7 is equivalent to the 
fraction of calls encountering congestion, and the two quantities will, in 
general, differ. 

Of these, of course, the measure of most concern to the network de- 
signer is call congestion. Now if it is assumed that calls originate com- 
pletely independent of the state of the network, time congestion will 
equal call congestion. Such an assumption is unjustified if calls cannot 
originate from busy lines, since time congestion conventionally includes 
busy line periods while call congestion excludes them. It is, however, 
reasonable to assume that idle pairs of terminals originate calls at a 
constant rate independent of the state of the network. In particular, 
there must be no change in the calling rate after a blocked call. If, under 
this assumption, time congestion is modified to include only periods in 
which both lines are idle, it will be equal to call congestion. Actually, 
even if the foregoing assumption is not met, the time between calls is 

likely to be much longer than the time taken by the network to return 
to equilibrium, so that, again, the modified time congestion will be close 
to the call congestion. 

With a suitable choice of branch occupancies, Lee’s model allows the 
computation of call congestion. Alternatively, the branch occupancies 
may be chosen so as not to reflect the requirement that only idle ter- 
minals are to be considered, thus allowing the computation of time 
congestion. In either case the computed results will be subject to the 
error introduced by inaccurate assumptions in the model. 

Before viewing the probability linear-graph model in the light of the 
above remarks, it is well to make an observation on the underlying 
philosophy of the model. Let us perform the following conceptual experi- 
ment in a real network under a particular set of (equilibrium) traffic 
conditions. Suppose that we fix our attention on a particular representa- 
tive input-output pair (7,4) and examine closely that portion of the 
network seen between them, Le., their graph. It is reasonable to believe 
that, were we to examine the detailed traffic pattern within the graph 
for a sufficiently long time, we would ultimately come to have complete 
knowledge of the busy-idle state behavior of the graph links under the 
particular traffic conditions. The experiment could be repeated under 
other equilibrium traffic conditions, so that we would eventually be able 
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to describe completely the behavior of the graph links under all equilib- 
rium conditions. Assuming that the particular input-output pair and 
connection graph studied are truly representative of the entire network 
(or at least of an entire connection class), we could then ‘‘discard” the 
rest of the network and determine the blocking probability of the net- 
work under various equilibrium conditions by computations or simula- 
tions based only on the connection graph and our complete knowledge 
of its behavior. 

That such complete knowledge could be obtained is a practical im- 
possibility. In the absence of complete knowledge, assumptions can be 
and, indeed, must be made about the detailed behavior of the graph 
based on such a priort knowledge as we have. It is reasonable to suppose 
that, as the assumptions made approach the real behavior of the graph, 
the blocking probability determined from the graph will approach the 
real blocking of the network. A belief in the fundamental soundness of 
this reasoning constitutes the basic philosophy of the graph model 
approach to the determination of blocking probability, beginning with 
Molina and continuing with C. Y. Lee and the NEASIM program. 

The assumption made by Lee of link independence [(e) of Section 
2.1], while obviously omitting much a priori knowledge of graph be- 
havior, possesses the practical advantage of allowing computation of the 
blocking polynomial where graph geometry does not prohibit. The basic 
NEASIM program allows evaluation of the polynomial for all graphs 
meeting Lee’s restrictions. 

The utility of the results obtainable from Lee’s model is well known. 
When the specific branch occupancies are chosen rationally,* the calcu- 
lated blocking agrees well enough with real blocking figures (obtained 
from full-scale simulation or measurement) for many engineering and 
design purposes. Accuracy can be improved by “calibrating” Lee’s 
results against real values where they are available. Furthermore, com- 
puted values lacking in absolute accuracy will reveal relative differences 
between networks and between various traffic conditions in the same 
network. 

The NEASIM program, to which the remainder of this paper is 
devoted, is basically designed to estimate the value of the blocking 
polynomial. This portion of its design and use is described in Sections 
IUI-V. The design also allows additional assumptions regarding the 
detailed traffic behavior of connection graph link states to be incor- 
porated in the graph model. When more a priori information is included, 


* That is, chosen to reflect the requirement that the input-output terminals 
j,k are idle by (usually) subtracting the load contributed by the terminals 7,k 
from the assumed carried link loads. 
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the validity of the simulation results improves as anticipated. This 
aspect of the NEASIM program is described in Section VI. 


III. THE NEASIM PROGRAM 


This part of the paper is devoted to a presentation of the program. 
The basic viewpoint or philosophy of the NEASIM approach is given 
in Section 3.1. A general description of the program is contained in 
Section 3.2. 


3.1 Philosophy of the Program 


We saw in the preceding section how the GRAPH model provides — 
in theory at least — a method for the analysis of a large class of switching 
networks and how the model is impractical as a tool for networks with 
complex geometrical structure. To evolve a practical tool, we shall 
change our method of approach. 

Suppose we are given a GRAPH G with branch occupancies 


Pi, °°' > Pm. 
Until now we were concerned with the explicit blocking polynomial 
Be = Beli, -+* , Pm). However, we can think of Be(y1, +++ , Dm) asa 


curve in Euclidean (m + 1)-space, and set as our goal a precise ap- 
proximation to this curve. This point of view immediately suggests the 
use of simulation techniques. 

Tor example, if all the branches in the GRAPH of Fig. 2(a) are busy 
with probability p, then Be = Be(p) is a curve in 2-space. If a com- 
puter simulation were to be performed to give an approximation to 
Be(p), then we would require a computer program containing an 
algorithm which, when repeated n times, will produce an estimate 
Be” (p) such that 


lim Ba” (p) = Be(p) 


N>o 


and for which we have confidence limits on the absolute error, 


| Be” (p) — Bap) |, 


for all n. 

Consider the eight branches of the GRAPH of Fig. 2(a). Although 
all of them are busy with probability p, at any one instant each of the 
branches is either busy or idle — and for this particular configuration 
of busy and idle branches there is or is not a path through the graph. 
Thus, in each repetition of the above mentioned algorithm, 
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(7) we assign busy-idle states to each of the eight branches in such 
a way that probability of busy in each case is p; 

(77) after the assignment has been made we determine whether or 
not there is a path through the graph for the given assignment. 
Let B;(p) represent the proportion of n repetitions of the algorithm 
when no path through the graph was found. We would then expect that 
lim Be” (p) = Be(p); that is, we would expect our estimator to 


converge to the true blocking curve. In general, if the GRAPH G has 
m different occupancies p1, °°: , Pm, the program should produce an 
estimator Bg'” (pi, +++, Pm) converging to Be(pi, «+ , Dm). 

The preceding heuristic remarks were intended to outline the essential 
approach taken by the NEASIM program. The remainder of Section 
III is devoted to a description of the program itself. Sections IV and V 
contain the arguments which show that the estimator indeed converges 
to the true blocking curve, and how confidence statements about the 
precision of the results are obtained. 


3.2 Program Description 


The following four sections describe the NEASIM program. Section 
3.2.1 takes a ‘‘macroscopic”’ point of view, beginning with an account of 
the input and then proceeding to give a broad outline of the program. 
Section 3.2.2 sketches the layout of data in the computer memory. 
Section 3.2.3 discusses the organization and operation of the NEASIM 
algorithm. The salient features of the program flow are shown in Fig. 
5. Storage requirements and execution speed are given in Section 3.2.4. 


3.2.1 Macroscopic Description 


NEASIM was written for the IBM 7090 computer. The input con- 
sists of punched cards which we categorize as Graph Definition Cards 
and Simulation Definition Cards. Since the notion of a probability- 
linear graph implies a geometrical configuration together with an oc- 
cupancy assignment on the branches, the computer must be supplied 
with both these types of information. The geometrical configuration is 
read into the computer via the Graph Definition Cards and the various 
occupancies are read in via the Simulation Definition Cards. 


Graph Definition Cards 


The information punched on these cards includes 
(1) the total number of nodes, 
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Fig. 5 — Flow diagram for the NEASIM program. 
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(2) the total number of branches, 

(3) the total number of branch occupancies, 

(4) the occupancy associated with each of the branches, and 
(5) the interconnection scheme between the various nodes. 


Simulation Definition Cards 


Suppose there are m different branch occupancies for the GRAPH 
in question. These m values are punched on a Simulation Definition 
Card. For every set of values of the occupancies pi, °°: , Pm — that 
is, for every point on the blocking curve Be(pi, -+* , Pm) — there is 
one Simulation Definition Card. 

The estimator Be” (pi, «++ , Pm) will converge to Be(pr, -** , Dm) 
as n — o. But the computer must be instructed as to when to terminate 
the run. Now, the NEASIM process is such that it is possible to request 
that the error | Be” — Be | lie within a given percentage of the true 
value B, . This “desired precision”’ information is also punched on the 
Simulation Definition Card. It may happen, however (as will be the 
case whenever Bg is very small) that, in order to obtain the requested 
percentage error, the total number of repetitions of the NEASIM 
algorithm will perforce be exceedingly large. Since a lengthy computer 
run is economically undesirable, and since a high degree of precision is 
usually not needed when the blocking probability is so very low, an 
upper limit on the number of repetitions, 7, is also supplied to the com- 
puter by being punched on the Simulation Definition Card. If, after a 
run has been made, a greater degree of precision is still needed, it is 
possible to “pick up where we left off’? and continue the simulation in 
another computer run. 


The Program 


The Graph Definition Cards are read into the computer first. With 
this information the program constructs, in effect, a map in the com- 
puter memory of the geometrical configuration of the GRAPH. More- 
over, the program associates with each branch an occupancy p; — whose 
value is as yet unspecified. The first Simulation Definition Card is then 
read in, and the program now assigns the appropriate values to 


pi, °**, Pm. Once this information is obtained, the program is ready 
to execute the NEASIM algorithm, which consists essentially of two 
parts: 


(2) a busy-idle assignment is made on all of the branches in accord- 
ance with the specified occupancies p1, --+ , Dm 3 
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(27) the presence or absence of a path is determined for the particular 
assignment. 

These two steps are repeated again and again until such time as the 
estimate Be’ (pi, ++: , Dm) has been found. 

The results for this point on the blocking curve are printed out, and 
the second Simulation Definition Card (if there is one) is read. The 
program then goes through the preceding steps for this second set of 
values of p1, --- , Pm until the estimate for this point on the blocking 
curve has been obtained. When all the Simulation Definition Cards 
have been processed, the program run ends. 


3.2.2 Memory Organization 


After the Graph Definition Cards have been read in, the program 
prepares five main tables as follows: 

(1) Node table 

(2) Node-Link table 

(3) Branch tables 

(4) Occupancy tables 

(5) Linkage table. 


The Node Table 


The size (i.e., the number of words) of the Node table is equal to the 
number of nodes in the GRAPH, there being a one-to-one correspond- 
ence between the words in this table and the nodes in the GRAPH. 
These words are used by the program to indicate, after a particular 
iteration of the NEASIM algorithm, whether or not there is a path 
from each particular node to the first node. 


The Node-Link Table 


The size of the Node-Link table is also equal to the total number of 
nodes in the GRAPH, with each word corresponding to a particular 
GRAPH node. For each node the table indicates 

(7) the number of branches leaving the node — connecting to nodes 
more distant from the origin, and 

(it) a reference to a section of the Linkage table where further infor- 
mation on each branch is stored. 

The Node-Link and Linkage tables together constitute the program’s 
map of the GRAPH geometry. The other tables provide storage for 
busy-idle indications and path information. 
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The Branch Tables 


There are as many Branch tables as there are different occupancies 
in the GRAPH. For occupancies p;, ++: , Pm there will be m Branch 
tables, which we denote by BRT), --- , BRT (m). The size of BRT «) 
equals the number of branches in the GRAPH which are busy with 
probability p;. There is a one-to-one correspondence between a particu- 
lar word in BRT, and a particular branch in the GRAPH. The associ- 
ation between the words in the Branch tables and the particular GRAPH 
branches is part of the information stored in the Linkage table. The 
Branch table words are used by the program to store the busy-idle state 
of every GRAPH branch on each iteration of the NEASIM algorithm. 
The total storage required for the Branch tables equals the number of 
branches in the GRAPH. 


The Occupancy Tables 


There are m Occupancy tables, OCT qa) , +--+ , OCT (my , corresponding 
respectively to BRT a), --- , BRT (m). The size of OCT) is an input 
parameter of the program chosen to be large compared with BRT,,) . 
Every bit in the table OCT(;) contains a binary one with probability 
p;. The Occupancy tables are used by the program to supply random 
busy-idle states for assignment to the branches of the GRAPH on each 
iteration of the NEASIM algorithm. 


The Linkage Table 


As previously mentioned, this table contains the detailed interconnec- 
tion information between the various nodes in the GRAPH. The size 
of the table is equal to the number of branches in the GRAPH. Each 
word in the table represents a branch, say 6;, in the GRAPH and 
contains 

(t) the address of a word in the Node table which corresponds to 
the node to which 0; leads, and 

(77) the address of a word in a Branch table which stores the current 
busy-idle state of the branch }; . 

Consider, for example, the GRAPH of Fig. 2(a) and suppose that 
the occupancy of branches b; and by is p: ; the occupancy of branches 
bs, bs, bs and bg is pe ; and the occupancy of branches b7 and bs is p; . 
The tables which the program would prepare are indicated in Fig. 6. 
The symbolic addresses, such as NW1, BRW8, etc., have been chosen 
for illustrative purposes only. 
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Fig. 6 — Computer memory layout for the GRAPH of Fig. 2 with occupancies 
pi, p2 and ps. 


3.2.3 Organization of the NEASIM Algorithm 


In a previous section it was mentioned that the NEASIM algorithm 
requires 

(t) an assignment of busy-idle states on the branches, and 

(2) a method of searching for the existence of a path. 
These tasks are performed by three program segments which we shall 
call 

(2) the PROBABILITY GENERATOR, 

(77) the BUSY-IDLE ASSIGNMENT, and 

(a) the MATCH routine. 
For the sake of program efficiency (speed) the task of assigning busy- 
idle states to the branches is divided into two parts. The function of 
the PROBABILITY GENERATOR is to generate tables of busy-idle 
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bits — the Occupancy tables. The BUSY-IDLE ASSIGNMENT rou- 
tine assigns busy-idle states to branches on each iteration of the algo- 
rithm. The function of the MATCH routine is to find whether or not 
there is a path, given a particular assignment of busy-idle states on the 
branches. Discussion of the internal logic of these programs will be 
deferred to Section IV. For the present we will be concerned only with 
their functions. 

After the Graph Definition Cards have been read, enough information 
is available for the program to reserve appropriate space for the Node, 
Node-Link, Branch, Occupancy and Linkage tables. Once space allot- 
ment has been made and the necessary entries filled into the Node-Link 
and Linkage tables, the program reads the first Simulation Definition 
Card. Among other parameters, this card specifies the m different branch 
occupancies desired. At this point the PROBABILITY GENERATOR 
fills in the Occupancy tables. When this routine has completed its task, 
each bit in OCT) ,2 = 1, ---+ , m, will contain a binary one with proba- 
bility p; and a zero with probability 1 — p;. (Recall that the NEASIM 
convention is that one represents busy and zero represents idle.) The con- 
tents of the Occupancy tables remain unaltered for the entire time that 
the program is seeking an estimate for one point on the blocking curve. 

The storage of many computers is organized into sequentially num- 
bered words, each of which consists of a fixed number of contiguous 
bits, and each of which is addressable by the stored program for logical 
and algebraic manipulation. The number of bits in an IBM 7090 word 
is 36, so that each word in the Occupancy tables represents 36 independ- 
ent busy-idle states — thus allowing for 36 independent repetitions of 
the NEASIM algorithm. 

After the Occupancy tables have been prepared, various counters are 
set to zero and the Node table is initialized. As the contents of the Node 
table indicate for each node the presence or absence of a path from the 
particular node to the origin, the program takes the attitude of the man 
from Missouri and assumes there is no path until one is proven to exist. 
Thus the words of the Node table are initially set to all 1’s except for 
the first node, which always contains zeros. 

At this point the program enters the BUSY-IDLE ASSIGNMENT 
routine. To assign busy-idle states to the branches, this routine steps 
through each of the Branch tables and for each word in BRT «), a 
word from OCT; is selected at random and its contents duplicated in 
the Branch table word. When BUSY-IDLE ASSIGNMENT has been 
completed, the program enters the MATCH routine. 

Using the linkage information stored in the Node-Link and Linkage 
tables, the MATCH routine performs its logic on the Branch and Node 
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tables to find whether or not there is a path through the graph for each 
of the 36 independent busy-idle configurations. The operation of this 
routine is analyzed in Section 4.2. 

When MATCH has completed its work, the number of 1’s in the word 
of the Node table corresponding to the terminal node in the GRAPH 
(NW6 in Fig. 6) will be the number of times there was no path through 
the graph in the 36 trials. This number of blocked attempts is noted and 
36 is entered into a ‘‘run-length” counter. The Node table is reinitialized, 
busy-idle states reassigned and the algorithm repeated again and again. 
The repetitions will terminate when one of two situations occurs at the 
end of the MATCH routine. Either 

(i) enough attempts have been scored to guarantee the precision 
called for on the Simulation Definition Card, or 

(iz) the maximum number of attempts specified on the card has been 
exceeded. 

When the repetitions terminate, the proportion of blocked attempts 
is calculated and the results are printed out. The next Simulation Defini- 
tion Card is read and the process starts over for the next point on the 
blocking curve. 


3.2.4 Slorage Requirements and Execution Speed 


The largest GRAPH that can be handled by the present version of 
NEASIM is determined by the core storage available in a particular 
computer. Total storage required is given by the expression 

P+2(N+ B) +0 
where: 

P is the number of storage locations required by the NEASIM 
program itself (about 2000 words), 
is the number of nodes in the GRAPH, 
is the number of branches in the GRAPH, and 
is the storage required for Occupancy tables — typically m X 1024 
where 1 S m S 8 is the number of occupancy tables. 

The NEASIM algorithm is designed for rapid execution on the IBM 
7090. Average speed depends principally on the size of the GRAPH. 
Typical speeds range from about 600 trials per second (550-branch 
GRAPH) to about 5000 trials per second (48-branch GRAPH). 


oy = 


IV. PROBABILITY GENERATOR AND MATCH ROUTINES 


The functions of the two routines called PROBABILITY GENERA- 
TOR and MATCH were mentioned in the previous section. The present 
section is concerned with the internal logic of these programs. 
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The algorithm used by PROBABILITY GENERATOR is due to 
W. C. Jones.” It is felt that this heretofore unpublished algorithm is of 
sufficiently widespread interest to be included in this paper. 

A word on notation: throughout Section IV we shall use the notation 
C[A] to mean “the contents of A,” where A represents some bit in the 
computer memory. (The reader is therefore cautioned to distinguish 
between “bit A” and its contents C[A].) 

Two computer instructions which will be referred to repeatedly are 
the logical (inclusive) “OR” and logical “AND” instructions. Instruct- 
ing the computer to perform an OR on two bits will guarantee that the 
result will be 1 if, and only if, either or both of the bits contain 1; while 
an AND yields 1 if, and only if, both bits contain 1. When we OR bit A 
to bit B, then we shall say that we ‘‘perform [A] OR [B];” when we AND 
bit A to bit B, then we say that we ‘‘perform [A] AND [B].” 


4.1 The PROBABILITY GENERATOR 


The purpose of PROBABILITY GENERATOR is to generate the 
occupancies p1,°*:,Pm- We mentioned in Section 3.2.2 that this 
subroutine will fill up OCT) (¢ = 1, --- ,m) with 36-bit words each 
of whose bits contains 1 with probability p;(z = 1,-+--,m). In the 
sequel we will focus our attention on (a representative) one of these 
36 bits — keeping in mind that the program is actually working on 
36 bits independently and in parallel. 

Suppose we wish to generate a random binary variable which takes 
the value 1 with probability p(0 < p < 1) and to place our result in 
bit X. The algorithm, when terminated, will give Pr{CLXY] = 1} = p. 

The first action taken by PROBABILITY GENERATOR is to ex- 
press p as a binary fraction to 10 places.* This fraction is then scanned 
from right to left until the first bit is found which contains a 1. The 
algorithm uses this binary fraction of n < 10 places determined by the 
scan. We can therefore, without loss of generality, express p as 


Dp = O.0ndn-1 +++ debi bi = 1502, °--, 6, = Oorl. 
A digit selected from a random binary number will be referred to as a 
‘random bit” in the following algorithm. In a random binary number, 
the value of each digit is 1 with probability 3. 
Algorithm: 
(7) Set 7 = 1. Generate a random bit, say r1, and store it in X. 
Thus CLX| = Pp 


* The number of places is arbitrary. Ten was chosen to make the round-off 
error smaller than 0.001, since occupancies are specified on the Simulation Defini- 
tion Cards to three decimal places. 
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(77) If 7 = n, go to step (viz). Otherwise, increase 7 by 1 and con- 
tinue. 

(217) Generate another random bit 7; and store it temporarily in some 
bit, say PR. Thus C[R] = 7;. 

(iv) If b; = 0, go to step (v). If b; = 1, go to step (v2). 
(v) Perform [R] AND [X]; store in X. Go to step (72). 
(v7) Perform [R] OR [X]; store in X. Go to step (72). 

(vit) Stop. 

We observe that step (77) is performed (iterated) exactly n times. 
Let P; be the probability that CLX] = 1 after the jth iteration. The 
assertion is that P, = p—i.e., after the algorithm is terminated, X 
will contain 1 with probability p. 


Proof of Algorithm: 


Consider two bits A and B, and let pa = Pr{C[A] = 1}, and pz, = 
Pr{C[B] = 1}. When the contents of A and B are independent, if we 
perform [A] AND [B], the probability of the result being 1 is 


Paps , 
while if we perform [A] OR [B], the probability of the result being 1 is 
Pa + Pa — PaPe- 

Now, Pr{r; = 1} = Prir; = Of = 2 7g =—1,+-*,n. 
Therefore if step (v) is executed, 

Pha = 3P; j= 1,---,n-1 (3) 
while if step (vz) is executed, 

Peo Say. pe lsh aed 

= 3P; + 3. _ 


But step (v) is executed only if b;41 = 0, and step (v7) is executed only 
if bj41 = 1. Hence (3) and (4) can be combined into 

Piz = 9P5 + dbinn oe ice melee 
Since P; = 3, it follows by induction that 


_ bn Dn-1 ees be bi | 
sim ya nae Mma a a 


and our assertion is proved. 
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4.2 The MATCH Routine 


The MATCH routine is entered by the program after the busy-idle 
states have been assigned to the branches. Its purpose is to find whether 
or not there is a path through the graph for any particular assignment. 
In the present section we first derive a recursive set-theoretic formula 
which may be used to determine whether there is a path. The remainder 
of the section shows how the recursion is carried out by the computer 
to produce an estimate which converges to the blocking probability of 
the GRAPH. 

In Section 2.2 we saw how the “‘path enumeration approach” could, 
in theory at least, be employed in determining the blocking probability. 
We again take the path enumeration approach, but from a slightly 
altered point of view: 

Instead of trying to find whether there is a path through the entire 
graph at one fell swoop (which amounts to finding whether there is a 
path from the first node to the last node), we shall try to find whether 
there is a path from the first node to each of the nodes in the GRAPH. 
While this approach may appear to inject an unnecessary complica- 
tion, we will see how, by stepping through the GRAPH in an orderly 
fashion, this approach lends itself naturally to computer programming. 
We begin by investigating, somewhat further, the geometrical structure 
of a GRAPH. 

Consider the general GRAPH (whose structure is shown schematically 
in Fig. 7) and suppose that there are a total of »v nodes. Fach GRAPH 
has a first node, Ni, a last node, N,, and several intermediate “stages” 
of nodes. The notion of ‘‘stage’”’ is made more precise by the following 
definition: a node N is said to be in stage s(s = 1, 2, ---) if, and only if, 
all paths from N,; to N contain no more than s — 1 branches, and there 
exists at least one path from N, to N which contains exactly s — 1 
branches. 

Any GRAPH will thus contain some number S 2 2 of stages, where 
the first and last (Sth stage) consist, respectively, of the single nodes 
N,and N,. Let , be the number of nodes in stage s,s = 1, --- , S (thus 
Mm = ng = 1); and define a quantity m, by 

Ms, = > Ni. 
i=1 
We choose to order the nodes of the GRAPH in the following manner: 


to each of the n, nodes in stage s (s = 2, --- , S) assign arbitrarily, but 
uniquely, one of the integers 


Mea + 1, mea + 2, -°+ , Msi + Ns s=2,---,8. 
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Fig. 7— A general GRAPH. 


The nodes of the GRAPH are hence totally ordered and may therefore 
be denoted by N;, wherez = 1, ---, »v. 

When N; and N; are connected by one or more parallel branches, 
where NV; , N; are arbitrary nodes, we say that, for ¢ < 7, N; is directly 
connected to N; through each of these branches and write NV; > N,. 
We shall suppose, without loss of generality, that if N; — N;, then 
there is only one connecting branch, b,’. (This restriction is assumed in 
order to avoid introducing yet another index, but will not affect the 
subsequent results. When a computer instruction involving b,’ is de- 
scribed, we shall understand that the computer is to execute this in- 
struction for all the b,’ for which N; > N;.) 

Let us now examine the graph, asking the question for every node 
N,;, “Is there an available path from N, to N; ?” Our objective is to 
answer the question for 7 = ». 

For each node N; (j > 1) consider the branches b,’ for which NV; > N;; . 
Clearly, for each of these branches, if there was no available path from 
N, to N; or there is no available path through the branch b,’, or both, 
then, and only then, will there be no available path from N, to N ; which 
passes through NV; . More precisely, let 

X; be the event: there is no available path from N, to N;, 
Y,’ be the event: there is no available path from N; to N; 
which passes through N;, 
B; be the event: branch b,’ is not available (busy). 
Then 
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Y/ = X,UB/ 

X,;= nN Yi= N (xX, UB). 
t<j t<j 
Nj>Nj NioNj 


Consider all the nodes N; such that N; — N,;. Suppose there are k; 


such nodes. Call them N;,, N;,, ++: , Nu;, wherets <  < +++ < %,. 
Next, define an event Z;‘” recursively by 
Z;5 = (X,, UB, YNZ? m=2,-°-,k; (5) 
ese 
I Sas v 
and 
Z;> = X,, UB,/ Ni, > N; (6) 
i are 2 


It then follows that 
ZS? aXe jo=z2er-yy (7) 
and in particular 
Zs? = xX, 


where X, is the event “there is no path through the graph.” The MATCH 
routine is based upon these formulas. 

We now focus our attention on (a representative) one bit in each 
word of the Node and Branch tables — again keeping in mind that the 
program is actually working on a full word of bits independently and in 
parallel. The program will thus execute 36 simultaneous iterations of 
the MATCH algorithm, which we now proceed to evolve. 

When MATCH is entered, the following state of affairs prevails: 

(4) To each node N; and to each branch b,’ there has been uniquely 
assigned one bit of computer memory, so that we can henceforth refer 
to these bits as bit N; and bit b,’. 

(17) Clb] = 1 with the appropriate occupancy p,? = Pr{branch 
b,’ is busy}. (These p;’ were called p; , -- + , pm in Sections II and III.) 

(7i7) The linkage information between the node bits N; and the 
branch bits b,’ is stored in the Node-Link and Linkage tables. 

It follows that representation of the event Z;°’ = X;, U B;,’ of for- 
mula (6) can be achieved in the computer by performing 


[N.,] OR [b:,'] 
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provided that we always sct C[Ni] = 0; and the event Z7;“? = X; of 
(7) can be represented by executing the following steps. 
(2) Perform [N;,] OR [b;,’]; store in N;. 
(77) Set m = 1. 
(27) If m = k; go to step (vz). Otherwise, continue. 
(iv) Increase m by 1. 
(v) Perform ({N;,,] OR [b;,,7]) AND [N,]; store in N;. Go to step 
(122). 
(vt) Stop. 
We observe that the algorithm may be condensed if we initially set 
CIN j] = 17 2 2): 
(7) Set m = 1. 
(iz) Perform ({N;,,] OR [b;,,’]) AND [N,]; store in N;. 
(zt) If m = k;, go to step (v). Otherwise continue. 
(zv) Increase m by 1. Go to step (77). 


(v) Stop. 
This initialization of the node bits — C[Mi] = 0, CIN) = 1 forj = 
2,3, --: , »— was mentioned in Section 3.2.3 and can be interpreted as 


meaning ‘‘there is always a path from N, to N; ; there is no path from 
N, to N; (j > 1) until proven otherwise.” 

Summarizing, the steps that the computer may take to represent the 
event Z,“” = X, are 

(M1) Set CIN, = 0. Set C[IN,] = 1,7 > 1. 

(M2) For each bit N;,7 = 2, ---, v, and in the order 


Nog Na, 8284 NG 
find all bits N; for which N; — N; and perform 


((N,] OR [b,7]) AND [N,,]; store in N;. 

Now, the statement ‘find all bits N; for which N; — N,’ implies 
that the input to the program is such that it specifies to the computer 
which nodes N; are directly connected to N;. It was felt that a more 
straightforward input format would be to specify to which nodes, N;, 
each node N; connects. With the linkage information stored in this 
fashion,* step (M2) may be replaced by the equivalent step 


(M2’) For each bit N;,7 = 1, --- , » — 1, and in the order 


Ni,N2, SON SN Gg 


* A careful perusal of Section 3.2.2 will show that this is indeed the way the 
linkage information is stored in the Node-Link and Linkage tables. 
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find all bits NV; for which VN; — N; and perform 

({NV;] OR [b,’]) AND [N,]; store in NV; . 
Steps (M1) and (M2’) constitute the MATCH algorithm for deter- 
mining whether there is a path through the graph. When the MATCH 
algorithm is terminated, C[N,] = 1 if, and only if, there is no path, so 
that Pr{C[N,] = 1} = Pr{no path}. 

Suppose the algorithm is iterated n times. In any one iteration the 
event C[b,’| = 1 is generated with probability p,/ and independently of 
the event C[b,’| = 1 in any other iteration. But Pr{C[N,] = 1} after a 
given iteration is clearly only a function of the probabilities Pr{C[b,] = 
1}. Hence, N, will contain 1 with the same probability, B, after each 
iteration, and we conclude that n iterations of the algorithm constitute 
a sequence of nm Bernoulli trials with probability B of success on each 
trial. Let £; be a random variable such that 


£; = 1 if C[N,] = 1 after the 7th trial 
0 if C[N,| = 0 after the 7th trial 


and let 


B,=& t&t+ vy ss Pee 
An application of the Law of Large Numbers (Ref. 29, p. 189) gives 
lim (B,/n) = B. 


Identifying B,/n with Be (p, «++, pm) of Section 3.1 and B with 
Be(pi, °++ , Pm) of the same section, this last result is equivalent to, 
and proves the assertion that, 

lim Be (pr yo cy Pm) — Be(pr 9700s Du): 


n>o 


Repeating the experiment often enough and dividing the number of 
times C[N,] = 1 (i.e., the number of blocked attempts) by the number 
of iterations n (1.e., the number of attempts) will therefore yield a 
precise estimate of the blocking probability of the GRAPH. How 
large n has to be in order to attain any given degree of precision is dis- 
cussed in the next section. 


V. RELIABILITY CONSIDERATIONS 


Since any simulation is nothing but an experimental measurement, 
and hence subject to statistical fluctuations, it is necessary to assess the 
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reliability of the results. Unfortunately, most of the testing for.a given 
simulation must be done on an a postertorz basis and there is, in general, 
rarely sufficient a priort information on which to base the decision of how 
long the run is to be. The need for such information becomes even more 
important when the cost factor in computer simulations is considered. 
It will be shown below that the NIZEASIM procedure is one which allows 
for a priort determination of run length. The decision of how long the 
run is to be is based on the desired precision and is made by the com- 


puter. 
In Section 4.2 we introduced the random variable B, , the number of 
times C[N,| = 1.in n repetitions of the experiment. B, will now be 


called ‘“‘the number of calls blocked in a simulation run of 7 calls.’’* 
The estimator B,/n was seen to converge to the blocking probability 
B, and is. indeed a maximum likelihood, mean-unbiased, minimum 
variance estimator. 

To generate the random binary numbers of Section 4.1, NEASIM 
uses the well-known multiplicative congruential method. This method 
for pseudo-random number generation has been discussed, tested, and 
used by numerous investigators’ since it was first proposed by 
Lehmer.*” Although the method has been demonstrated to generate 
35-bit random binary numbers, there is some measure of cyclic behavior 
in the low-order bits. NEASIM forms a word of 36 random bits by 
combining the most random halves (18 high-order bits) of two 35-bit 
words generated by this method. A further check on the randomness is 
provided by NEASIM itself, which, as part of its output, prints the 
generated branch occupancics. We therefore assume that the events 
C[N,| = 1 are independent for individual trials of the experiment. The 
results of any given computer run should thus be binomially distributed 
(see also Section 4.2), and hence asymptotically normal. 

As an illustration, we again turn to the GRAPH of Fig. 3. For a 
branch occupancy of p = po = 0.5, formula (2) yields B = 0.0525. 
The NEASIM program, for a run of n = 201,600 calls, gave B = 
0.0529 — an error within 1 per cent. (This run took some 40 seconds of 

* The new nomenclature is chosen to indicate a different interpretation of what 
NEASIM is doing: NEASIM looks at the configuration of possible paths through 
the network between two subscribers, takes ‘‘snapshots”’ of the current busy-idle 
states of the links in these paths, and then finds if there is a path for each snap- 
shot. Thus the program is essentially running calls through a portion of the 
network — the portion of the network being a representative one, and hence one 
from which significant statistical data can be extracted to describe the perform- 
ance of the entire network. A similar approach was taken by A. Feiner, W. C. 
Jones, and others,?° who wrote ‘‘abbreviated’’ simulations in which the busy- 


idle state data were taken from previously run full-scale simulations, but for 
which a new program had to be written for each new network to be simulated. - 
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computer time.) To study the normality, the number of blocked calls 
was noted for every 1008 calls run. Since B, is the number of blocked 
calls in a run of n calls, the binomial distribution function in this case is 


8 
Prip, =e} = os ou BY. — By 
k=0 U 


and the approximating normal distribution function (Ref. 29, p. 172) 
is (p41), where 


= 
I 


(t — 1008B)h 
[1008B(1 — B)y"? 


> 
I 


and 


1 mS oa 
d(x) = We [ ody 


is the standard normal distribution function. Fig. 8 is a plot on proba- 
bility paper of the cumulative frequency distribution determined by the 
computer, and the theoretical normal distribution line for B = 0.0525. 

In order to obtain meaningful results, it was felt that the run-length 
should be determined by the following criterion: the number of trials 
shall be large enough so as to give 95 per cent confidence that the esti- 
mator lies within a fixed percentage of the true value of B. Since the 
blocking probability is unknown at the outset, this criterion is more 
useful than requiring the estimator to lie in a fixed interval about B. 
Thus, we wish to choose n large enough so that 
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Fig. 8 — Cumulative distribution for GRAPH of Fig. 3 (p = po = 0.5) ina 
run of 201,600 calls in which the number of blocked calls was noted for every 
1,008 calls run. The points are experimental data; the line is the theoretical dis- 
tribution, b(xg43) 
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Bs _ pl = aph = 095 


where a is an input parameter to the program representing the desired 
percentage. Rearranging terms, we get 


r =a 1_B xm Ue 


VW/nB(1 — B) 

where, for n sufficiently large, the distribution of the random variable 
B, — nB 

/nB(1 — B) 


approaches the standard normal. Our requirement will therefore be 


> 9 


n 2 SS (2-1). (8) 


It follows that the number of trials should be increased as the blocking 
probability decreases in order to maintain 95 per cent confidence that 
the estimator lies within a fixed percentage of the true value —as 
intuition dictates. On the other hand, for a fixed B, as n increases, the 
95 per cent confidence limits will narrow. This is displayed in Fig. 9, 
where the program results for the GRAPH of Fig. 3, for B = 0.0525, 
are seen to lie within the confidence limits. 

Furthermore, since (1/B) — 1 S (1/B) for all B in the unit interval, 
the requirement will be met if 


3.84 1 
N22 B: 














or 


But for large n, B & (B,/n). Hence, making the substitution we obtain 


pease. 

= 
As long as the number of simulated calls is large enough so that the 
number of blocked calls is at least 3.84/a’, there is 95 per cent confidence 
that B,/n is within aB of B. 


NEASIM was written to accept several values of a ranging from 0.05 
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Fig. 9 — NEASIM blocking estimates for GRAPH of Fig. 3 vs run length 
(p = p2 = 0.5, B = 0.0525). 


to 1.00. I'rom the specified value, it determines the minimum number of 
blocked calls necessary to guarantee this precision. Fig. 10 shows the 
results of the simulation of the GRAPH of Fig. 3 for a = 0.10 and 0.50. 
In GRAPHs where B is very small, it is usually unnecessary to estimate 
B with a very high degree of precision — especially at the expense of 
costly computer runs. An upper limit for n is therefore also specified. 
The computer then proceeds with the simulation until it either exceeds 
the lower limit on B, or the upper limit on n. 
Tn the latter case, the reliability can be assessed as follows. Since 


mB(1— B) >= J = 


we can rearrange terms and obtain 
Pr{dB’ + eB + f S 0} = 0.95 


where 
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Fig. 10 — NEASIM blocking estimates for GRAPH of Fig. 3 vs branch oc- 
cupancy p = pe. 


d=n' + 3.84167 
e = —(2nB, + 3.8416 n) 
f = B,’. 


It is easily verified that for all n, B, > 0, the parabola dB’ + cB +f 
is concave upward, has real roots, and that B,/n lies between the roots 
whenever B, < n. Thus for any simulation run, the 95 per cent confi- 
dence interval can be obtained by solving for the roots. 

Now in all cases of interest, the product of the roots, 


f/d = B,’/(n’ + 3.8416 n), 


can. be closely approximated by (B,/n)’, so that B,/n can be taken as 
the geometric mean of the roots. Suppose the higher root is B, and the 
lower root B, ; then 
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B, = kB,/n 
B, = (1/k)B,/n. 


The roots were calculated over a range of values of B, and n, and the 
results are displayed in Fig. 11, where & is plotted as a function of B,/n 
for various values of n. As an example, suppose that in a run of 80,000 
calls, B, = 80 were found blocked. Then B,/n = 0.001 and Fig. 11 
gives k = 1.244. Hence B, = 1.244 (0.001) = 0.00124, B. = 0.001/ 
1.244 = 0.000803, and there is 95 per cent confidence that 0.000803 < 
B #S 0.001244. The importance of the case under consideration would 
then determine whether a longer simulation is necessary. 

Since the GRAPH geometry may be as complex as indicated in Fig. 4, 
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Fig. 11 — The 95 per cent confidence limits for B:(1/k)(B,/n) S B S k(B,/n). 
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it is important to be able to verify that no error was made in mapping 
the geometry into the computer memory. To this end, a program was 
prepared by Miss D. Logan which, using the Graph Definition Cards 
as input and an SC-4020 Microfilm Recorder, draws a picture showing 
the GRAPH geometry. 


VI. POSSIBILITIES FOR INCREASED REALISM 


In the preceding section it was shown that the NEASIM program 
produces precise, reliable results in comparison with Lee’s probability 
linear-graph model of switching networks. Unfortunately, in complex 
switching networks, substantial differences may exist between the 
estimates obtained with Lee’s analytical technique and actual perform- 
ance determined by field measurement or full-scale (complete) simula- 
tion. These differences appear to be largely attributable to the unreality 
of the assumption [(e) of Section 2.1] that the link random variables are 
independent. 

The dependence that exists between the different links in a network 
stage and between the links in adjacent stages is incompletely under- 
stood, but nonetheless real. Attempts to take account of interlink de- 
pendence by judicious modification of link occupancy values have been 
moderately successful in calculations and may, of course, be employed 
in NEASIM runs. However, the nature of the NEASIM process suggests 
alternate approaches which may ultimately prove to be more fruitful. 
While considerable success has been obtained with some of the tech- 
niques discussed in this section, much remains to be accomplished. 


6.1 Dependence Effects within a Switching Stage 


Two possibilities for increased realism within the confines of a single 
link stage appear worthy of mention. The NEASIM program typically 
assigns link stage busy-idle states from a binomial distribution. An 
obvious suggestion (but one upon which little work has been done) 
would be to modify the PROBABILITY GENERATOR and/or 
BUSY-IDLE ASSIGNMENT routines in such a way as to produce 
busy-idle state assignments taken from various distributions — Ja- 
cobaeus’ # distribution,‘:* for example. 

A second approach, which has been extensively used, is to incorporate 
program routines between BUSY-IDLE ASSIGNMENT and execution 
of the MATCH routine. These routines examine the random busy-idle 
assignments and make appropriate assignment changes where GRAPH 
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geometry or other considerations indicate. An example of such a routine 
is one which is designed to insure that there exists at least one “‘sure- 
idle” branch out of ann X n input switch. If calls are only placed be- 
tween idle network terminals, at least one branch out of ann X n input 
switch must be idle. But if the branches leaving the initial GRAPH 
node are specified at occupancy p, then the program would normally 
make all these branches busy with probability p”. The SURE-IDLE 
routine, upon finding such an assignment, will select an initial branch at 
random and make it idle. This action, of course, disturbs the otherwise 
binomial distribution of branch states and should be taken into account 
when branch occupancy values are specified. 


6.2 Interstage Dependence Effects 


An interesting technique for introducing interstage dependence exists 
within the NEASIM framework and is based upon an obvious extension 
of the SURE-IDLE routine just described. Briefly stated, a routine 
could be designed to examine the busy-idle assignments made on the 
input and output branches of each node of the GRAPH. Acting on 
knowledge of the switch geometry and other factors, the “‘dependencing”’ 
routine could change the initial busy-idle assignments where necessary 
to make them more realistic. Only a very simple and admittedly in- 
accurate routine has been used to date with, however, a remarkable 
increase in the “realism” of the results obtained. 

The simple-minded DEPENDENCING routine currently employed 
assumes that every GRAPH node is in reality an n X n switch. It 
observes that in an n X n switch each input branch has probability 1/n 
of being connected to any particular output branch. It attempts to 
implement its idea of reality by, for each input branch, examining each 
output branch and forcing the output branch state to agree with the 
input branch state with probability 1/n. While this routine can produce 
rather quaint effects, such as duplicating the state of one input branch 
on all output branches, it does possess the virtues of simplicity (rapid 
execution) and a basically correct notion of interstage dependence. 

That the use of the SURE-IDLE and DEPENDENCING routines 
can be effective is demonstrated in Fig. 12. The results shown in the 
figure are for a moderately complex eight-stage switching network whose 
GRAPH has 232 branches. NEASIM results with and without de- 
pendencing and sure-idles are compared with the results of a full-scale 
simulation. The improvement in realism possible with the rudimentary 
routines just described appears quite dramatic. 
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Fig. 12 — Comparison of NEASIM results (with and without dependencing 
and sure-idles) with full-scale simulation data for a realistically complex network. 


VII. CONCLUSION 


The NEASIM approach to switching network simulation has achieved 
its major goal, the mechanization of a widely employed — if somewhat 
unrealistic — technique for load-loss analysis. The applications of the 
GRAPH model need no longer be restricted by the overpowering com- 
putational difficulties brought on by GRAPH complexity. 

Furthermore, the simulation of an analytical model concept basic to 
NEASIM seems to open new areas for profitable exploration in the 
analysis of switching systems — and perhaps other stochastic systems 
as well. The success of the elementary realism-injecting routines suggests 
that further research along this line may be rewarding. 

Finally, the degree of realism in results attained so far, coupled with 
the ease of application, has produced what amounts to a new tool for 
use in both the design and engineering of new switching networks. It is 
now feasible to achieve relatively complete and accurate load-loss 
engineering data on complex switching systems well in advance of actual 
field experience. 
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Filling Factor and Isolator Performance of 
the Traveling-Wave Maser* 


By F. S. CHEN and W. J. TABOR 
(Manuscript received December 16, 1963) 


In designing a large gain and simultaneously large instantaneous band- 
width traveling-wave maser (TWM), the filling factor and the isolator per- 
formance should be optimized. 

The filling factor 1s a measure of the efficiency of interaction between the 
spin system of the maser material and the RF magnetic fields of the slow- 
wave structure. For the TWM using the 90° operation of ruby and the 
comb as the slow-wave structure, the c axis of the ruby should be parallel to 
the z axis of the structure (the direction of the signal wave propagation) 
for the largest filling factor. The improvement of the filling factor by the 
proper orientation of the c axis of the ruby is larger at the lower signal 
frequencies because the transition probability of ruby is more nearly linear 
at those frequencies. 

The isolator should provide sufficient reverse absorption to make the 
TWM short-circuit stable and yet add the minimum forward absorption to 
the TWM. Both the reverse and the forward absorption of the isolator de- 
pend critically on the size of the ferrite disks and the position in which they 
are wmbedded in the comb structure. 

An analysis of the filling factor and tsolator performance and its com- 
partson with measurements was made. Together with Refs. 1 and 2, 
this paper is intended to reduce the amount of experimental work involved 
in developing traveling-wave masers. Although the discussion 1s centered 
on the comb-type ruby TWM, the data provided also apply to other tape 
slow-wave structures using different active crystals. 


I. INTRODUCTION 
In order to make a large gain and simultaneously large instantaneous 
bandwidth traveling-wave maser (TWM), it is necessary to orient the 


* This work was supported in part by the U. 8. Army Signal Corps under 
Contract No. DA 36-039-SC-89169. 
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active material so that the susceptibility tensor has a maximum inter- 
action with the RF magnetic field of the slow-wave structure. Quantita- 
tively this is expressed as a filling factor, F, where:! 


F =F.F,. (1) 


F, is the volume filling factor and represents the fraction of the RF 
magnetic energy that is inside the active material. If all of the RI 
magnetic energy is within the boundaries of the active material, then 
this factor can have its maximum value of one. F,, is the phase filling 
factor and represents how well the RF field can couple to the suscep- 
tibility tensor of the active material. For example, if the active material 
is represented by a susceptibility with a transverse component of 
magnetization which is circularly polarized, and a structure with the 
RF magnetic field that is circular and in the same sense of rotation, 
then /’, is equal to one. It is clear therefore that the maximum value of 
the total filling factor F' is one. 

The present design for a TWM uses a comb structure that is loaded 
with active material on both sides and nearly fills all of the available 
space. There are, however, some regions that are not filled with active 
material; for example, the space occupied by the isolator, the space 
between the fingers, and the region near the wall of the waveguide 
housing that is important for the shaping of the w-8 response of the 
structure. However, these spaces are considered small, and therefore 
the volume filling factor is taken to be nearly one. Since the volume 
filling factor has nearly its maximum value, no further consideration 
will be made on F, ; instead, our attention will be directed to F, . 

Since the comb structure has almost no RFI fields in the direction of 
the fingers’ and the ruby has no susceptibility component along the de 
magnetic field when used in the 90° operation,’ it is clear that the de 
magnetic field must be placed parallel to the fingers of the comb for 
maximum efficiency. Therefore, the only degree of freedom for the c 
axis of the ruby is a rotation about an axis parallel to the fingers. In 
the comb structure the RF magnetic field changes with position and 
with 6, where @ is the phase difference between adjacent fingers of the 
structure and can have values from 0 to a over the passband of the 
structure. In addition, the susceptibility of the active material is a 
function of frequency. Hence F, varies in a complicated fashion over 
the passband of the structure. It is our intention to find an expression 
of F,, as a function of the various dimensions of the structure, the angle 
between the ¢ axis of the ruby and the z direction of the comb structure, 
and the phase difference 6. 
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The isolator incorporated in the comb structure consists of a linear 
array of ferrite disks arranged to have the same periodicity as the comb. 
The disks are so shaped that the ferrimagnetic resonance occurs at the 
same de field and frequency as that required by the ruby. Two require- 
ments of the isolator are: first, the isolator should provide reverse ab- 
sorption at least equal to the round-trip gain provided by the ruby and 
second, the ratio of the reverse to forward attenuation should be high. 
Our objective in this part of the analysis is to find the effect of the 
various structural parameters of the comb on the performance of the 
isolator and to provide a guide toward the optimum design of the iso- 
lator. It is found that the structure that provides maximum gain per- 
formance is not the same as that for optimum isolator performance. 
Therefore, the attitude has been to design the structure for maximum 
gain performance and then to optimize the isolator within the constraints 
imposed by this structure. 

In this paper, the attention will be centered on the comb as the slow- 
wave structure, a ferrimagnetic material such as YIG as the isolator, 
and ruby as the active material, where the latter is only considered in 
the 90° operation; i.e., the de magnetic field is perpendicular to the 
crystalline c axis of the ruby. However, the analysis applies equally well 
to other types of taped structures and active materials, since the results 
are expressed as a function of 6 and not of the frequency. The connection 
between the frequency and @ is made through the w-8 relations of the 
particular structure. 

The assumption is made that the RF field configuration of the struc- 
ture is unperturbed by the presence of the spin system of the ruby and 
the ferrite disks. This is justified, since the spins in ruby are very dilute 
and the ferrite disks are thin. 

Under the following two conditions: (7) no RF fields in the direction 
of the fingers and (77) the field on that part of the z axis between the 
adjacent fingers independent of z (uniform field assumption), the RF 
magnetic fields in region 2 (see Fig. 1) can be expressed as: 


H,=j pus A, cosh B,(D — x)e 7" cos 5 (2) 
H, = >> A, sinh B,(D — x)e**" cos se (3) 
Brl 

An = 


nig (4) 
u LB, sinh B,D 
2 


1008 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


_ 9+ 2n0 


ee ee 
Ey = —2j F sing (6) 


where the voltage on the mth finger is taken as V, = Voe?”’, and h 
is the sum of the finger length and the correction due to the fringe 
capacitance at the finger tips. The rest of the notation is shown in Fig. 
1. Equations (2) and (3) also assume W = D, which is a good approxi- 
mation to the TWM with a large-gain, instantaneous-bandwidth 
product now in use. 

Neglecting the copper and dielectric losses of the structure, the 
microwave signal power in the TWM can be expressed as: 


| : of [ Hex): H* dv + | Hex." an} 
T m t 


P, = P; me he 
[un dv 


where P; and P, are the input and output power, respectively. Lr 
is the total length of the structure, v, is the signal group velocity and 
x1” and x” are the imaginary parts of the susceptibility tensor of the 
ruby and ferrite disks, respectively. The volume integral in the de- 


nominator with the subscript v encloses the whole volume of the struc- 
ture per period, and the volume integrals in the numerator with the 


Lidl ed) 
IN SS N F 





L 
Z=ML Z=(M4tNL 


(a) (b) 


Fig. 1 — Comb structure loaded with ruby. The ¢ axis of ruby makes an angle 
a with the z direction. 
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subscripts m and 7 enclose the volume of ruby or ferrite per period, 
respectively. 

By performing the integration in (7) using the RF fields expressed in 
(2) and (3), the filling factor and the isolator performance can be found. 
The filling factor will be derived in Section II and the isolator perform- 
ance will be discussed in Section III. In both sections the results of 
theory are compared with the measurements. In Section IV, concluding 
remarks will be made. 


II. FILLING FACTORS 


2.1 Analysis and Discussions 


The susceptibility tensor in (7) is a classical expression, whereas the 
properties of ruby are given in terms of the spin Hamiltonian; it is 
therefore necessary to establish a connection between xi” and the 
quantum mechanical properties of ruby. The subscript one will now be 
dropped from x,”. Throughout this paper MKS units will be used. 

Classically, the absorbed power is given by 


Pave = dys | Hex” -H* dy (8) 


where yo is the permeability of free space, w is the frequency in radians 
per second, and H is the RF magnetic field. The quantum mechanical 
expression for absorbed power is given by: 


Pars = i W2(p1 = p2) hw dv (9) 


where p; and p, are the densities of spins in levels 1 and 2 respectively, 
h is Planck’s constant divided by 27, and W,- is the transition proba- 
bility between the levels in question. W1-2 is in general a function of the 
position. 

Equating (8) and (9) and dividing through by the total RF energy, 
bof 1-H*dv, one obtains: 


[wx Hd 2 | Wrslor = pe)ti dv 
Ota asst Eat ee a ae sae Ps (10) 
[aH cP wo | HH is 


The left-hand side of (10) is exactly the expression occurring in (7). 
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One can further define: 
/ H-H* dv 


F, = 7 (11) 
[ wu ap 


/ H-y”-H* dv 


I 


Fy (12) 


(Xeinr” + Xytyi) | H-H* dv 


where F’, and F, are the volume and the phase filling factors, respec- 
tively, and X22”, Xyy” are the diagonal elements of the susceptibility 
tensor. x..-” is not present in (12), since this term is zero for the 90° 
operation. The prime in the subscripts of x” refers to the crystal axes 
of ruby. The z’ axis is taken parallel to the crystalline c axis. (see Fig. 
1). In general, both the RF magnetic field and the spin magnetic mo- 
ment have elliptic polarizations, and when these ellipses have the same 
sense of rotation, the same principal axes and the same shape (ie., 
He Xyy" = H, x22"), then F, as defined above has its maximum value 
of one. The actual value of F, for the comb structure will then be a 
measure of how the fields depart from this ideal case. 
Comparing (10), (11) and (12) one obtains: 


2 | W1-2(p1 — pa)h dv 
Fy = ——2.—__________. (13) 


bo (X2r2r” a Xy'y"”) / H-H* dv 


It is now necessary to calculate the detailed expression for W1.. . One 
starts with an unperturbed Hamiltonian 3C) and the time-dependent 
Schrodinger equation 


Roy = th(dp/dt) (14) 


whose solutions are ¥ exp — [(E,/n)t]. For this calculation the yy ei- 
genfunctions are those obtained from the spin Hamiltonian for ruby.° 
A perturbation term is now added which will represent the coupling of the 
spins in ruby to the RF magnetic field. If this perturbation is called 
5, , then the equation that must be solved is: 


(Hy + 3C:)U = th(OU/at) (15) 


where 
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KH, = u-H (16) 
u is the magnetic moment of the spin, and H is the RI’ magnetic field. 
u is given by: 

= 98S (17) 


where g is the spectroscopic splitting factor, 8 is the Bohr electronic 
magneton, and S is the spin angular momentum operator. 

The RF magnetic fields are given in (2) and (3). For convenience 
they can be expressed as: 


H, = cos an ss Ln (6,2) (- ‘) [exp t(wt + 8,2) ies 
— exp —t(wt + 6nz)] 
Hz = cos 2% >) M,,(6,c) (4) [exp (wt + Byz) 
2h “a (19) 
+ exp —t(wt + 6,2)] 


where 


L, = A, cosh B,(D — x) 
M, = A, sinh B,(D — 2). 
The coordinate system for the ruby and comb structure is shown in 


Fig. 1. For an angular separation of a between the z and 2’ axes, (16) 
becomes: 


KR, = gB[(S. sina + S, cosa)H, + (S» cosa — 8S, sina)H,}. (20) 


Equation (15) will be solved in terms of an infinite series of unper- 
turbed solutions yy , 1.e., 


U = d Cy(t) exp — [¢(B)/h)¢] (21) 


where the expansion coefficients depend on the time. | C(t) |’ is the 
probability of finding the spin in the state y at the time ¢. Initially the 
spin is assumed to be in the state ¥, i.e., | Ci(0) |* = 1, | C,(0) ? = 
0 for \ > 1. The perturbation is then turned on and a solution for 
| C2(t) |? is sought. | C2(t) |’ is the probability of finding the spin in 
state 2. For weak RF magnetic fields this term is equal to: 


| Co(t) P = Wrat. (22) 


The process of obtaining | C2(t) |? is a standard one in quantum 
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mechanics’ and will not be given in detail here. The result is 

Wis = GEE g(a — ox) (OnOm + PaPn) (28) 
where g(w — wo) is a normalized line shape function such that: 


i g(w — wo) dw = 1 


On = ((Sy)M, + (S2)Iin) COS a COS Brz 

— ((S2)M, + (S,)Ln) sin a sin Bp2 
((Si)Mn + (Sy)L,) sin a cos Bz 

+ ((8,)M, + (S2)Iin) Cos a sin B,2 


S 
3 
I 


(Se) = f wSevo* dv, (Sy) = f rSyrbe* do. 


Substituting (23) into (12), one obtains: 


g'8'rg(w — a0) (or — pe) | X (OnOn + PaPu) dv 
gt ae 
Tito (X2r2"” + Xyry”) [wu dv 


If we were to examine just one element of the x” tensor, say x22”, 
and subject it to a simple field, H, sin wl, then by going through the 
same procedure [using (13)—(22)] one could establish that: 

22 
xara” = FE glo — 0) (0 — 02) || Se 12) (25) 
and similarly for the other elements in the tensor. 

Using (25) to simplify (24), one obtains: 


/  (OnOm + PoPm) dv 
F, = - (26) 


((Sar)? + (S,-)”) if H-H* ai 


The phase filling factor will now be calculated for two cases: (1) a 
ruby TWM loaded on one side and (2) a ruby TWM loaded on both 
sides. In both cases it is assumed that the ruby entirely fills the gap 
between the fingers and the outer waveguide wall and extends to the 
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open tip of the fingers. In addition, it is assumed that, in the case of a 
TWM loaded with ruby on one side only, the opposite side is still 
filled with a material whose dielectric constant equals that of ruby. 
In practice this assumption is fulfilled very closely with recently de- 
veloped masers.’ By performing the integration in (26), one obtains 


(Se-)(Sy) 
Fy 3) — 3 Gee ADs Sp 3 
+ ert Gee 


[(S:-)’ — (Sy)’] cos 2 ee 
zs’) oo y’ COS 2a 1 
2[(Se-)” + (Sy) 2) 
for the structure filled on one half side, and 

Po) 2 [(Sz)" — (Sv)'] cos 2a > (1) (28) 


2[(Sar) + (Sy) 
for the completely filled structure. >> (4) and >> (1) are defined as 


follows: 
Oey Brl 
eae (*") 


amc 
2 





> @) = (29) 
sin” (%!) 
3 Ey 2 / cosh 8,D 
B,D () sinh 6,D 
2 
Ee sin” (2) 1 
2 B,D (Fy sinh? 8,,D 
ue - (30) 


2 sin” (2!) 
3 Eo 2 / cosh B,D 
B,D (F) sinh 8,D 
2 


The + sign of the second term in (27) determines the nonreciprocal 
behavior of the ruby TWM. In the direction of propagation where the 
elliptically polarized fields of the comb structure are in the same sense 
as the near circular transition of ruby, the plus sign applies. In the oppo- 
site direction, where the fields are not as well matched, the minus 
sign applies. 
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Fig. 3— 24) vs 6/m for 1/L = 0.25 and four values of the parameter D/L. 
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Fig. 5— 2X (1) vs 6/m for 1/L = 0.25 and four values of the parameter D/L. 
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MAGNITUDE OF THE MATRIX ELEMENT 








FREQUENCY IN GIGACYCLES PER SECOND 


Fig. 6 — A plot of (S.-) and (Sy) as functions of frequency. 


The >.’s are in general functions of the parameters 1/L, and D/L 
and 6, where the terms 1, L and D are defined in Fig. 1. )>(4) and >0(1) 
are plotted as a function of 6 in Figs. 2, 3, 4 and 5. Fig. 6 is a plot of 
the absolute values of the matrix elements, (S.-) and (S,-), as a function 
of frequency. The matrix elements are calculated between the levels 
— << —+4 for the 90° operation of ruby. With the data presented in 
Figs. 2 to 6 and (27) and (28), the phase filling factor for a ruby loaded 
comb structure can be closely estimated. 

Equation (28) shows that the phase fillmg factor for the completely 
filled structure is 3 if the transition probability of the ruby is circular 
(i.e., (Sz) = (S,)) regardless of the angle a, or, if a = 7/4 radians, 
regardless of the ratio (S,-)/(S,). This can be shown to be the con- 
sequence of the symmetry of the RF magnetic fields about the plane 
of the fingers. Equation (28) also shows that the filling factor does not 
depend on the sign of a, i.e., cos 2a is an even function of a. This 
result is a consequence of the symmetry of the RF fields about a plane 
parallel to the z-y plane and centered between the fingers. 
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The filling factor increases as D/L becomes smaller (see Fig. 3) if 
(S,’) is greater than (S,’). This is due to the fact that the RF magnetic 
fields become more linear as D/L gets smaller. . 

The ratio of the electronic gain of the TWM with both sides loaded 
with ruby to that with only one side loaded with ruby is shown as a 
function of @ for different frequencies in Fig. 7. The curves are obtained 
assuming D/E = 0.75, 1/L = 0.5 and F,, = 0.5 for a half loaded and 
F, = 1.0 for a fully loaded comb. Even at 10 ge, where the transition 
probability of the ruby is becoming fairly circular, a 30 per cent in- 
crease in gain can be obtained by loading the second side with ruby. 
As an example, let us consider a structure with 30 db electronic gain 
when the ruby is loaded on one side only. The round-trip gain would 
be 30 + 0.3(80) = 39 db, (assuming @ = 7/2). An isolator of at least 
39 db would be required to assure short-circuit stability, and since the 
reverse-to-forward loss ratio of an isolator is about 20, the forward 
loss of the isolator would be approximately 2 db. If the structure were 
now loaded on both sides, the forward gain would increase to 39 db 
and the round-trip gain would be 78 db. The forward loss of the isolator 


oo 


® 


oe 












































Ge (08) [LOADED ON BOTH SIDES] /Ge (08) [LOADED ON ONE SIDE] 
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Fig. 7 — The ratio of the electronic gain (in db) of a maser when loaded on 
both sides to that when loaded on one side. The structures are otherwise identical, 
with a = 0°, 1/L = 0.5, D/L = 0.75. 
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for this structure would be approximately 4 db. The net gain (excluding 
the common copper loss) is therefore 28 db for the single-sided loading 
and 35 db for the double-sided case. It is therefore clear that two-sided 
loading is more efficient at least up to X-band, although not by a very 
large margin. 


2.2 Experimental Verification 


Equation (27) was checked experimentally by measuring the para- 
magnetic absorption in a structure loaded on one side with ruby. The 
other side of the comb structure was loaded with alumina which has 
about the same dielectric constant as ruby. The measurements were 
performed in absorption so that complicating factors such as a micro- 
wave pump and isolator could be avoided. Equation (7), of course, 
defines the absorbed power as well as the gain of the TWM. The pass- 
band of the structure was centered at approximately 5.4 ge. Since 
(27) is a function of the angle between the c axis of the ruby and the 
z direction of the structure a, two different pieces of ruby were used: 
one with a = 67° and another with a = 20°. 

In the evaluation of I’, the volume filling factor, /’, , was taken to be 
exactly 4. All the measurements that enter into (7) were performed at 
4,2°K. v, was calculated from the w-8 response of the structure. The 
w-6 plot was obtained by a bridge technique in which the difference in 
path length of the two arms of the bridge was carefully accounted for 
and subtracted from the measurement. In this way, the correct w-8 
response of the comb structure was assured. 

The term (pi — pz) that enters into the susceptibility formula (25) 
is difficult to determine exactly, since it depends on the chemical analy- 
sis of the ruby. Since the major concern of this experiment is the filling 
factor and not the accuracy of the chemical analysis, the concentration 
was left as a parameter to be used as a best fit to the calculated value of 
F, . The assumed concentration was then compared with that obtained 
by chemical analysis.° 

Figs. 8 and 9 are plots of the experimentally measured and calculated 
values of F,. Table I compares the assumed concentration to that 
obtained by chemical analysis. 

Figs. 8 and 9 both show the same type of behavior. The ratio of ex- 
perimentally measured values to the calculated values decreases as 6 
increases. Part of this may be explained by the fact that the value of 
the volume filling factor, F, , was taken to be exactly 3 for all values of 
6. This is a good approximation when @ = 0°, since no RF fields exist 
between the fingers, but at large values of 6, RF fields do exist between 
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Fig. 8 — A comparison of the theoretical and experimental results for a ruby 
with a = 67°. The solid line is the theoretical curve and the points are the experi- 
mental measurements. 


fingers where there is no ruby, and therefore F, < 4. If Ff, becomes 
less than 4, the experimentally determined value of F/, would corre- 
spondingly increase and approach the calculated one. 

The assumed chromium concentrations compare very well with those 


determined by chemical analysis. In fact, in the case where a = 20° 






































Fig. 9 — A comparison of the theoretical and experimental results for a ruby 
with a = 20°. The solid line is the theoretical curve and the points are the ex- 
perimental measurements. 
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TABLE I — CoMPARISON OF ASSUMED CONCENTRATION TO 
CoNCENTRATION DETERMINED BY CHEMICAL ANALYSIS 


Assumed Conc. Chemical Analysis 

& (Cr/Al Atomic %) (Cr/Al Atomic %) 
67° 0.082 0.030 
20° 0.028 0.028 


the concentrations are the same and therefore Fig. 9 can be considered 
to be free of all adjustable parameters. However, the concentrations 
determined by chemical analysis cannot be considered highly accurate 
on an absolute basis, and therefore this agreement could be fortuitous. 


III, ISOLATOR PERFORMANCES 


The isolator consists of thin ferrite disks imbedded periodically in- 
side the ruby slab as shown in Fig. 10. The de magnetic field is applied 
in the y direction, and it is normal to the plane of the ferrite disks. The 
ferrite disks are usually thin and have square cross sections. The ratio 
of the side dimension to the thickness of the square (aspect ratio) is so 
adjusted that the ferrite resonates at the same de magnetic field as ruby 
for a given frequency. 

In Section 3.1, the analysis and the result of the machine computation 
will be presented. The calculation is compared with the measurements 
in Section 3.2. 


3.1 Analysts and Discussion 


Since the dielectric constant of the ferrite is about the same as the 
surrounding ruby. the ferrite disks are thin, and the plane of the pre- 
cessing spins coincides with the plane of the RF magnetic field before 
the introduction of the ferrite disks, then the disturbance of the RF 
fields due to the presence of the resonating ferrite disks should be small. 





Fig. 10 — Ferrite disk isolators imbedded in the comb structure. 
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Perturbation theory, which assumes the RI fields inside the ferrite to 
be the same as those present if the ferrite were replaced by the ruby, 
should be appropriate for the analysis here. 

We shall regard the ferrite square block as an approximation to an 
ellipsoid. Then the demagnetization factors N,, N, and N, in the 2, 
y and z directions, respectively, can be defined in the usual manner. 
In this approximation, the demagnetization factor of a square disk 
will be approximated by the demagnetization factor of a circular disk, 
ie., Nz = N,. Even if the disk is not square, VN, ~ N, & 0 if the disk 
is sufficiently thin. In suen cases, the imaginary part of the external 
susceptibility tensor x2” becomes diagonal when the RI fields are 
expressed in circular components. 

From (2) and (3), the positive and negative circular components 
of the RF magnetic field can be defined as 


1 
wos eee _ Bu(D—2) 5 IBZ ry 
H,= Vi (H, —jH.) = vi Ane cos (31) 


HM. = -. (H, + jH.) = AX Ay € PaO) 6 Hn2Z Gog TY (39) 
a! ae : as Dh 


Let x,” and x_” be the diagonal elements of the diagonalized x2”. 
They can be expressed as’ 


mT! 


x = joo? tT (33) 
Gul 
"= pa EE oe 
we = y[Ho + (Nz — N,)47rM] (35) 
Om = yarM (36) 
2 
/ ee A AS 2.8 me/oe (37) 


where 47M is the saturation magnetization, Hy is the externally ap- 
plied de magnetic field and AZ is the linewidth of the ferrite. In terms 
of the circular components the absorption due to the ferrite is given 
by [see (7)] 


ae { Ho Gps / H_H_* d 
sy. = . : - neper (88) 


4 [ Gt + HLH*) do 
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wLr x4” [ 1 dv + x” [ nae dv 


Ug 


Qa. = 


neper (39) 
| (He ae) as 


where a, and a_ are the reverse and the forward absorption of the 
isolator, respectively. For most of the ferrites used in TWM’s, x” < 
x,.”. Therefore, the terms containing x” can be neglected. Substituting 
(31) and (32) into (88) and (89) and performing the integrations, one 


obtains 
a, 20g (£) db (40) 
Ug \Qs 
where f is the frequency and Q. is defined as 
1/Q, = t/h [eos (ry/2h)P x” A (41) 
1/Q_ = t/h [eos (ry:/2h))? x4.” B. (42) 


A and B are ratios of the energy of the RIF magnetic field due to the 
positive and the negative circular components over the area occupied 
by the ferrite disk to the total RF magnetic field energy contained over 
the area in the x-z plane per period, respectively. They can be expressed 
as 


7 ae ee 

Be ae 2 sinh 26,,D 

- (Bn — Bn) (d — c) ae (Bm — Br)(d-+c) (48, 44) 
2 


0 re) sin 


KOS DS AA a 


M=—-00 N=—0O (Bin? Bar) LD 
X [exp (8m + Br) (D — a)\[1 ¥ exp F (8m + Bn) (b — a)] 


where the upper and the lower signs are for A and B respectively. The 
notations a, b, c and d are defined in Fig. 10. In (43) and (44), the RF 
field energy contained in the region between the fingers (region 1 in 
Fig. 1) is neglected, since this is usually small compared to the RF 
field energy contained in the ruby. 

Equations (43) and (44) were computed and the results are shown 
in Figs. 11-14. Note that A is a quantity proportional to the reverse 
absorption and A/B is the ratio of the reverse to the forward absorption 
of the isolators. Large A and A/B over a large range of @ are desirable 
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for the isolators. Since Figs. 11-14 are shown in terms of 6/7 rather 
than the frequency, the frequency-phase relation (w-6 diagram) of 
the structure has to be known in order to convert 6 into the frequency. 
All the dimensions involved were normalized by the length of a period 
of the structure L. In order to provide an easier understanding of the 
numerical data, we shall assume L = 0.08 inch (the commonly used 
size in our laboratory) and discuss the data with the parameters ex- 
pressed in inches. 

Since the isolation ratio becomes worse when the width of the wave- 
guide housing gets narrower, we shall study the structure with small D 
first (D/L = 0.75). In Fig. 11, A and A/B vs 6/r are shown when 
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Fig. 11— A and A/B vs 6/r for the following parameters: D/L = 0.75, 1/L = 
0.5; ferrite size, (0 — a)/L = (d — c)/L = 0.125 (0.01 X 0.01 inch); solid curves 
for (D — a)/L = 0.6875 (a = 0.005 inch); broken curves for (D — a)/L = 0.625 
(a = 0.01 inch). 


1024 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 





















































Fig. 12— A and A/B vs 6/x for the following parameters: D/L = 0.75, 1/L 
0.5; ferrite size, (0 — a)/L = (d — c)/L = 0.125 (0.01 X 0.01 inch); (d+ ¢)/L 
1.0 (ferrite on an axis midway between the adjacent fingers). 


ferrite disks of 0.01 X 0.01 inch are placed in different positions of the 
structure. Solid lines are for the case where one edge of the disk is 0.005 
inch away from the surface of the fingers, i.e., a = 0.005 inch (see Fig. 
10). If the center of the disk is exactly at the position midway between 
the adjacent fingers [((d + c)/L = 1.0], the isolation ratio A/B can be 
as large as 140 over a small range of @. If the disk is shifted in the z 
direction by 0.01 inch [dd + c)/L = 0.75], the isolation ratio decreases 
to values less than’ 20 while A remains almost the same. Next, let us 
take a = 0.01 inch (see the broken lines). At the position (d+ ¢)/L = 1.0, 
the isolation ratio again reaches over 140. Again, shifting the position 
in the z direction by 0.01 inch decreases the isolation ratio to about 20. 
One sees readily that the isolator disks have to be positioned carefully 
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at the position midway between the adjacent fingers. A small displace- 
ment from this position deteriorates the isolation ratio rapidly. This 
has been known experimentally for a long time, and indeed the position 
of the largest A/B for a given a dimension is found in practice by moving 
the bar imbedded periodically with the isolator disks until the minimum 
forward attenuation results. Hence we shall assume the center of the 
ferrite disk to be on the line midway between the adjacent finger (z = 
L/2) in all of the following discussions. 

Next, let us consider how the isolator performs as a is increased 
further. Fig. 12 shows A and A/B vs 6/z for 0.01 X 0.01 inch isolator 
disk as a is increased to 0.005, 0.01, 0.015, 0.02 and 0.03 inch [corre- 
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Fig. 18— A and A/B vs 6/7: D/L = 0.75, I/L = 0.5; ferrite size, (b — a)/L = 
(d — c)/L = 0.25 (0.02 X 0.02 inch). 
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Fig. 14 — A and A/B vs 6/m: ferrite size, (6 — a)/L = (d — c)/L = 0.875 (0.038 X 
0.03 inch); solid line, D/L = 0.75, (D — a)/L = 0.6875 (a = 0.005 inch); dashed 
line, le = 1.0, (D — a)/L = 6.8125 (a = 0.015 inch); peceliee dashed line, 
D/L = 1.0, (D — ()/L = 0.9375 (a = 0.005 inch); //L = 
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sponds to (D — a)/L = 0.6875, 0.625, 0.5625, 0.5, 0.375]. The maximum 
isolation ratio A/B is about the same from a = 0.005 to 0.015 inch 
and then it decreases for a larger a. This can be understood by noting 
that the RF magnetic field becomes linearly polarized at the wall of 
the waveguide housing, and hence A/B approaches one if the ferrite 
disk is placed near the wall. The value of @ at which the maximum 
A/B occurs gradually increases with the increase in a. If this were not 
so, a large value of A/B would still be maintained for ferrite disks of a 
larger cross section. Due to the spread of the peaks of A/B the ferrite 
disks of a larger cross section give small values of A/B. Larger values 
of a also reduce the reverse absorption (smaller A) since the RF energy 
is concentrated near the fingers. 

In practice, the isolator disks of 0.01 X 0.01 inch cross section are 
too small to fabricate and also too small to provide enough reverse 
absorption. In order to see to which direction the size of the ferrite 
disks should be enlarged, let us consider two rectangular disks of 0.01 X 
0.03 inch. One of them has a longer side parallel to the z axis and the 
other has a shorter side parallel to the z axis. Both disks have a = 0.005 
inch. Then one can show from Figs. 11 and 12 that for the former case 
A=9X 10°, B = 3.2 X 10° at 6 = 0.37 where A/B is maximum, 
and for the latter case A = 8.1 X 10°, B = 1.08 X 10 ‘at 6 = 0.357 
where A/B is maximum. Apparently the latter is the better way to 
place rectangular ferrite disks. 

Fig. 13 shows A and A/B vs 6/z for the ferrite disks of 0.02 X 0.02 
inch. The edge of the ferrite disks is placed a = 0.005, 0.01 and 0.015 
inch away from the surface of the fingers. As the ferrite disks are moved 
away from the surface of the fingers, the isolation ratio improves and 
then gets worse again. The reverse absorption steadily decreases at the 
same time. The sharp reduction of the isolation ratio by enlarging the 
size of the ferrite disks can be understood from the following considera- 
tions. At a given 6 (or frequency), part of the ferrite disk at the position 
where B is small contributes little to the forward absorption, while the 
rest, located at the position where B is large, absorbs much of the for- 
ward-wave energy. Hence the forward absorption of the whole ferrite 
disk is larger. In another words, the peaks of A/B vs 6/z curve occurs 
at a different 6/z for different portions of the ferrite disk (see Fig. 12) 
The ferrite disks of this size are still not large enough to provide suffi- 
cient isolation for the high electronic gain attainable in present TWM’s. 

Fig. 14 shows A and the isolation ratio A/B vs 6/a when the size of 
the ferrite is increased to 0.03 X 0.03 inch (see the solid lines). The 
constant A increases almost twice from the value for 0.02 inch square 
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ferrite. Assuming the thickness of the ferrite ¢ = 0.1 (d — c) (aspect 
ratio of 10), the reverse absorption of the isolator increases about 
2 X 1.5 by increasing the size of the disk from 0.02 inch square to 0.03 
inch square. However, the isolation ratio reduces from the maximum 
of 29.5 to 19.5 (the forward absorption increases 4.5 times). 

It is clear by now that the isolation ratio of the isolator improves 
rapidly as the size of the ferrite disks gets smaller. On the other hand, 
the size of the ferrite has to be large to provide enough reverse absorption. 
Thus in order to improve the performance of the isolator, one faces 
contradicting requirements. One approach to circumvent this difficulty 
is to look for a ferrite of much larger susceptibility. Another solution 
is to look for a proper dimension of the structure to provide a larger 
area of circular polarization of the RF magnetic field. In Fig. 14 A 
and A/B vs 0/m are shown with a larger spacing between the fingers 
(ferrite size is 0.03 inch square). It is increased from 1 = 0.04 to 0.05 
and to 0.06 inch while L is kept at 0.08 inch. The isolation ratio improves 
gradually. | 

Next consider the effect on the isolator performance when D/L is 
increased from 0.75 to 1.0. This will in general increase the group veloc- 
ity.” From (7) and (40), the net db gain of TWM G,, is 


1 ” I 
Gr ee (¥ x cao ) 45 
2 (FIx"| > (45) 
where I is the filling factor and | x” | is the susceptibility of the inverted 
spin system of the ruby. lor the TWM to be short-circuit stable, 


1/Q4 > 2F |x" |. (46) 


The isolator incorporated in the TWM should satisfy the condition (46) 
first, and then comes the consideration of how to reduce 1/Q_. The 
change in v, will change the net gain and the reverse and the forward 
isolator absorption but not the stability condition (46). Hence, we shall 
confine our discussion here to the effect of Q, as D/L is increased. 
Suppose the condition (46) is satisfied with the isolator of 0.03 X 0.03 X 
0.003 inch, a = 0.005 inch, //ZL = 0.5 and D/L = 0.75. If D/L is in- 
creased to 1.0, one sces from the irregular dashed curves in Fig. 14 
that A remains almost the same, while the ruby filling factor reduces 
(see Fig. 4). Thus the TWM is still stable. However, the isolation 
ratio A/B increases from 19.5 to 25 (ie., 1/Q- decreases). Therefore 
the isolator performs better with larger D/L. Unfortunately, D/L is 
usually made small to provide a small group velocity near the center 
of the passband instead of being sized for the consideration of the 
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optimum isolator performance. Even if the forward absorption of the 
isolator increases with a smaller D/L, the net gain of the TWM may 
increase. 


3.2 Comparison of the Calculation with Afeasurements 


Isolators of 0.02 X 0.02 X 0.002 and 0.03 X 0.03 X 0.003 inch 
polycrystalline YIG disks were made and imbedded inside the comb 
structure of the dimensions shown in Fig. 15. The forward and the 


@4. IN DECIBELS 























Q_ IN DECIBELS 








0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 
O/T 
5.3 5.4 5.5 5.6 5.7 


FREQUENCY IN KILOMEGACYCLES PER SECOND 


Fig. 15 — Measured and calculated forward and reverse losses. Ferrite disks 
0.02 X 0.02 X 0.002 inch (71 pieces). L = 0.08 inch, D = 0.06 inch, 1 = 0.04 inch, 
a = 0.016 inch, AH = 223 oe, 4rM = 1750 oe. Solid lines are the calculated ax. 
and the circled points are measured. Broken line is the calculated a,/a_ and 
the triangles are measured points. 
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reverse absorption were measured and shown as discrete points in 
Figs. 15, 16 and 17. The polycrystalline YIG has 47M = 1750 oe at 
room temperature. Its linewidth AH was determined by measuring the 
de magnetic fields at which the reverse isolator loss in db is one-half 
of the maximum. It is more than twice the usual linewidth. This is 
presumably due to the variation of the aspect ratio among the YIG 
disks and also partly due to the shape of the ferrite disks being square 
instead of ellipsoidal. The finger length of the structure was 0.198 inch 
but in the calculation of a4 it was assumed that h = 0.22 inch to correct 
for the fringe capacitance at the finger tips. 

The w-6 relation of the comb structure was measured by a phase 
bridge technique described in the previous section. This makes it possible 
to relate w to 6 and also gives the group velocity. 
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Fig. 16 — Measured and calculated forward and reverse absorption. Ferrite 
size: 0.02 X 0.02 X 0.002 inch (71 pieces). L = 0.08 inch, D = 0.06 inch, J = 0.04 
inch, a = 0.01 inch, AH = 223 oe, 47M = 1750 oe. Solid lines are the calculated 
as. and the circled points are measured. Broken line is the calculated a,/a— and 
the triangles are measured points, 
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Fig. 17 — Measured and calculated forward and reverse absorption. Ferrite 
disks = 0.03 X 0.03 X 0.002 inch (85 pieces). LZ = 0.08 inch, D = 0.06 inch, 1 = 
0.04 inch, a = 0.009 inch, AH = 251 oe, 47/1 = 1750 oe. Solid lines are the cal- 
culated a, and the circled points are measured. Broken line is the calculated 
a,/a— and the triangles are measured points. 


Figs. 15 and 16 show the calculated and the measured data with 
0.02 X 0.02 X 0.002 inch ferrite disks. Fig. 15 is for the case a = 0.016 
inch and Fig. 16 is for a = 0.01 inch. The curves of Fig. 13 show that, 
by decreasing a from 0.016 to 0.01 inch, the reverse absorption increases 
more strongly at large 6 than at small 6, and that the maximum value 
of a,/a_ occurs at a smaller 6. These theoretical predictions are con- 
firmed by the measurements, as can be observed from the data in Figs. 
15 and 16. 

The measured and the calculated ay and a;/a_ for the ferrite size 
of 0.03 X 0.03 X 0.002 inch are shown in Fig. 17. a, for the 71 pieces 
of ferrite disks was too large to be measured in our equipment. There- 
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fore every other ferrite disk was removed to reduce ax by half. Com- 
paring the measured data shown in Figs. 16 and 17, one notices that, 
by increasing the size of the ferrite disk from 0.02 to 0.03 inch square, 
the reverse absorption a+ increases about 1.6 times while the maximum 
isolation ratio a,/a— decreases by a factor of 2.4. These are approxi- 
mately the values expected from the calculation. 

The measurements agree well with the calculation in general. It should 
be mentioned that in calculating a,, use was made of parameters 
which can be measured only within an accuracy of 10 per cent. 


IV. CONCLUSIONS 


It has been shown that the filling factor increases as the c axis of the 
ruby approaches the z axis of the structure for the usual 90° operation 
in ruby. The increase is more significant near the 6 = 0 end of the 
passband of the comb structure than near the 6 = a end. For the for- 
ward-wave structure (df/d@ > 0), this corresponds to larger increase 
of the filling factor near the lower cutoff frequency than near the upper 
cutoff frequency. The increase in I’, as the c axis of the ruby becomes 
parallel to the z axis of the structure is more pronounced at lower 
frequencies, since the transition probability of the spin is more linear 
at those frequencies. 

The ratio of the reverse-to-forward absorption of the isolator in- 
corporated in the TWM can be over a hundred if the size of the ferrite 
disks is very small compared to the width of the waveguide housing, 
2D + d, and the size of the fingers. At present, the isolation ratio is 
far from the best due to the necessity of using large-size ferrite disks 
to provide enough reverse absorption. The size of the ferrite disks can 
be kept small and yet provide a large reverse absorption if the imaginary 
part of the susceptibility of the ferrite, x”, can be increased. Since at 
resonance x” = 47M /AH, one should look for a material with a large 
47M and a smaller AH. The saturation magnetization, however, can- 
not be made arbitrarily large, since the de magnetic field, which is 
fixed by the ruby resonance, must be greater than 47M in order to 
saturate the isolator disks. AH of the ferrite should be about 20 oe, 
which is approximately the linewidth of ruby. AH of polycrystalline 
YIG at the temperature of 4.2°K is over 200 oe. This can be reduced 
to a certain extent by minimizing the scattering in the aspect ratios of 
the ferrite disks and the use of round disks instead of square disks, 
since the internal field of a round disk is more uniform than that of a 
square one. A single-crystal YIG with AH enlarged to about 20 oe 
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has good possibility as an improved isolator material for TWM ap- 
plication. 

The analysis also shows that the isolation ratio gradually improves 
and then gets worse, while the reverse absorption steadily decreases, 
as the ferrite disks are moved away from the surface of the fingers 
toward the waveguide wall. Also, the optimum isolation ratio gradually 
shifts to a higher value of @. 

In order to provide enough reverse absorption, more than one iso- 
lator deck is usually stacked on top of another. It may seem as if the 
forward insertion loss of the isolator can be kept small over a wider 
range of 6 if a composite isolator with different a’s is used instead of 
one with the same a. However, by adding the curves of different a’s 
in Fig. 12 one will find that this is not so. 

The analysis and the curves provided in this paper enable one to 
estimate the increase in electronic gain obtained by a proper orientation 
of the c axis of the ruby and by the use of ruby slabs on both sides of 
the comb. The size of the ferrite disks necessary to provide sufficient 
isolation and the dependence of the isolator performance on the position 
of the ferrite disks are also discussed. Together with the Refs. 1 and 2, 
this paper constitutes part of an effort to reduce the experimental work 
involved in developing traveling-wave masers. 
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The Comb-Type Slow-Wave Structure For 
TWM Applications* 


By F. S. CHEN 
(Manuscript received December 5, 1963) 


The space harmonic analysis of the dielectrically loaded comb structure 
as used in traveling-wave masers (TWM) is presented. The frequency-phase 
characteristics (the w-8 diagrams) are computed by regarding each finger 
of the comb structure as a capacitive loaded transmission line. The rmpedance 
of the line ts based on the space harmonic analysis. Computed data are found 
to be in agreement with experimental results and, in particular, tt is con- 
jirmed that the w-B relation depends very critically on certain dimensions 
of the dielectric loading. The results of the analysis are used to derive pre- 
scriptions for the design of dielectrically loaded TWM comb structures, 
especially of structures with low group velocity which are suitable to pro- 
vide simultaneously large gain and large instantaneous bandwidth. 


I. INTRODUCTION 


The comb-type structure has been used successfully as a slow-wave 
structure for traveling-wave masers (TWM).! In a TWM the small- 
signal gain in db is inversely proportional to the group velocity. Gener- 
ally the gain obtainable from present maser materials is small. Therefore, 
a great deal of effort in developing a TWM is concerned with deriving a 
comb structure design with small group velocity at the frequency of 
interest. Both the group velocity and the passband of the comb structure 
can be found from the w-8 diagram. The shape of the w-6 diagram of the 
comb structure loaded with ‘“masing”’ crystal depends very critically on 
the various dimensions of the structure as well as of the erystal. In this 
paper, an analysis of the comb structure will be presented. It should 
serve as a guide for a reasonably accurate determination of the dimen- 
sions of the comb structure, of the active maser material and of other 
dielectrics which give rise to a required w-8 diagram. Some additional 


* This work was supported in part by the U.S. Army Signal Corps under Con- 
tract No. DA 36-039-SC-89169. 
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experimentation may be needed in practice for small corrections of the 
resulting w-8 characteristic. 

Various tape structures with the tapes perpendicular to the direction 
of signal propagation have been proposed and analyzed??4»5 in the past. 
The space harmonic analysis originally introduced by Fletcher? was 
used to obtain the w-8 diagrams of these tape structures. It assumes no 
RF field components in the direction of the tape (TEM wave approxima- 
tion); that is, the tape can be regarded as a transmission line supporting 
the TEM wave in the direction transverse to the direction of signal 
propagation. An impedance matching condition for the TEM lines in the 
transverse plane can be derived (transverse resonance). The resulting 
equation implicitly contains the w-8 relation. The authors mentioned 
treated only the case where the structure is immersed in a uniform di- 
electric, that is, mostly vacuum. For the TWM application, the structure 
is alway partially loaded with dielectric and the TEM approximation no 
longer holds. However, when the structure is nearly filled with dielectric, 
as in TWM’s for a large gain and simultaneously large instantancous 
bandwidths,® one finds that the TEM approximation can also be success- 
fully used to calculate the w-@ diagram of the dielectric-loaded tape 
structures. An “effective dielectric constant”’ is then defined to take into 
account the fact that the structure is only partially loaded with dielec- 
tric. This approach was used in earlier calculations of the upper and 
lower cutoff frequencies of the comb structure by Harris, DeGrasse and 
Schulz-DuBois.’ The analysis to be presented here extends their work to 
cover the entire w-G diagram. 

The w-8 diagram of the comb structure! and the ‘‘Karp structure’’’ for 
TWM applications had also been discussed previously, using the 
equivalent circuit method. However, the field analysis to be presented 
here gives a more detailed understanding of the structure. For instance, 
the filling factor of the active crystal and the performance of the isolator 
embedded in the structure are readily obtained by this analysis. 

In the next three sections, the w-8 diagram of the comb structure, the 
impedance of the tapes (or fingers) and the effective dielectric constant 
will be derived. This is followed by a more detailed discussion of various 
properties of the comb structure, including the techniques available for 
reducing the bandwidth of the passband, higher-order transmission 
bands of the structure and practical design considerations. The calcula- 
tions are usually compared with experiments, and they are found to be 
in good agreement. 

The impedance of the finger and the effective dielectric constant are 
defined as a function of 6, the phase angle between adjacent fingers. The 
phase angle 6, or equivalently the phase propagation constant 8, are 
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related to the frequency w by the w-8 relation which is obtained from the 
transverse resonance matching condition. Only the ruby-loaded comb 
structure is treated here up to and including numerical details. It is 
easily possible, however, to extend the same type of analysis to other 
types of tape structures. This is particularly easy since both the im- 
pedance and the effective dielectric constant are evaluated here as 
functions of 6 and not of w. 


Il. THE w-8 DIAGRAM 


Let us consider first a comb structure without dielectric loading 
(empty comb). The open end of the fingers has fringing electric fields 
terminating at the surrounding conductors. The effect of these fields can 
be expressed by a capacitance C' (I*ig. 1) which in general is a function 
of 6, C = C(@), where 6 is the phase angle between the adjacent fingers 
and takes values from zero to 7 radians over the passband. It will be 
assumed that the RI electric and the magnetic fields vanish in the 
direction of fingers (y direction) except at the finger tip. Then the finger 
can be regarded as a TEM transmission line with a characteristic im- 
pedance K and supporting a wave propagation in the y direction with 
velocity c, the velocity of light in free space. K is a function of 6, K = 
K(@). At the finger tip (y = h), the impedance looking in +y and —y 
directions must have the same magnitude and the opposite sign; there- 
fore there exists a matching condition 


wh 
30) = K(6) tan —. (1) 


According to this equation, the grounded finger presents an inductive 
reactance at the finger tip. The length of the finger h is therefore limited 
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Fig. 1 — Cross sections of the comb structure. 
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(m — 1)m < wh/e < (m — §)r, ae ee (2) 


For a given 6, one can find the frequency from (1), provided the func- 
tions K(@) and C(@) are known, and in this way the w-@ diagram (or 
w-6 diagram) may be derived. For m = 1, the finger acts essentially as 
a quarter-wave resonator and (1) gives the lowest passband for a given 
geometry of the comb structure. For m = 2, the finger behaves as a 
three-quarter wave resonator and the next higher passband appears. 
Thus the comb structure provides a series of passbands separated by 
stop bands. 

When the comb structure is partially loaded with dielectric as in Fig. 
1, there appear components of RF fields in the y direction, and the finger 
no longer behaves as a TEM line. However, when the structure is al- 
most completely loaded with the dielectric, the fields are again approxi- 
mately TEM waves in the y direction. We will adopt this TEM approxi- 
mation for the loaded structure and modify the impedance K and the 
propagation constant w/c of (1) by a factor ~/e(@). €(@) is called the 
effective dielectric constant and it will be defined more rigorously in Sec- 
tion IV. Then the dispersion equation for the loaded comb can be ex- 
pressed as 


1 K(6) wh ~/€(0) 
ss FS = tan ——*—* 
wC' (8) / (6) c 
for the case where the structure is loaded with dielectric of uniform thick- 
ness from y = 0 to y = fas in Fig. 1. For simplicity K(@), C(@) and 
e(@) will be abbreviated as K, C and ¢ from here on. The TEM approxi- 
mation offers a further advantage, since it enables one to analyze the 
various ‘finger tip loadings” in a simple way. 

It turns out that, in practical realizations of the comb structure, the 
fringe capacity C is always small, so that wh+/e/c is close to (m — 4)x 
with m = 1, 2, 3, --- . Hence (3) can be simplified by defining a quan- 
tity Ah through 


(3) 


woVelh + Ah)/c = (m — 3)r. (4) 
Substituting (4) into (3) and using the fact that Ah < h, one obtains 
Ah = (KCc/e) (5) 


where it should be noted that Ah is also a function of 6. Equation (4) 
becomes 


Cee 1) d= ve(n + XO) (6) 
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where \ = c/f is the free-space wavelength. For the empty comb, the 
w-8 diagram can be obtained from (6) by letting « = 1. Equation (6) 
states that, in the passbands, the finger length h plus the correction due 
to the fringe capacitance at the finger tip should be (2m — 1)\/4 when 
measured with the scale ~/. . 


III. IMPEDANCE OF THE FINGER, K (6) 


The admittance of a single finger as a TEM line is a somewhat ab- 
stractly defined quantity. One considers transmission lines having the 
cross section shown in Fig. 1(b), but without dielectric and infinitely long 
in the +y directions, or alternatively suitably terminated. On this set of 
identical transmission lines one considers waves of equal amplitude 
traveling, for example, in the +y direction and phased by 6 between 
adjacent lines in the +2 direction. Under these conditions, the admit- 
tance of a single finger is defined as the ratio of current to voltage on a 
finger and is a function of 6. 

Consider a comb structure as in Fig. 1, but without the dielectric and 
with the structure divided into three regions as shown. Then the current 
on a finger is the sum of the current on the surface of the finger in region 
1 (between the fingers) and the current on the surface of the finger in 
regions 2 and 3. Thus the admittance of the finger is the sum of two 
admittances: 


Y(6) = Yi(6) + Y2(@) (7) 


where Y1(6) is the admittance due to the current on the surface of the 
finger in the region 1 and Y2(@) is the admittance due to the current on 
the surface of the finger in the regions 2 and 3. 

The current on the finger can be found by a line integral of the RF 
magnetic fields, which in turn can be found by matching boundary con- 
ditions in the x-z plane. 

The potential on the mth finger may be written as 


Vn = Ve?” (8) 


where @ is again the phase angle between the adjacent fingers and may 
vary from 0 to z. Then, as will be shown in Appendix A, the potential 
along the z axis between the mth and the (m + 1)th finger can be ex- 
pressed as 

0 


V@z) = Voc" Ee cos 5 — f(z) sin H| (9) 
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where g(z) and f(z) specify the symmetric and the antisymmetric part 
of the potential distribution respectively. g(z) is also the potential dis- 
tribution when @ = QO, and f(z) is also the potential distribution when 
§ = x. It should be added that the representation of the 6-dependent 
potential distribution by a combination of 6-independent symmetric 
and antisymmetric functions, g(z) and f(z), is at best a good approxi- 
mation or, to use a more appropriate term, a guess. The representation 
is strictly correct only for 6 = 0 and 6 = zw. However, it is bound to be 
a good approximation near 6 = 0 and near @ = x where one of the func- 
tions dominates. The calculations show that the detailed shape of the 
potential distribution assumed is of little influence in the midband re- 
gion near @ = 7/2 and in fact up to @ = -. It is in this sense that the 
expression into a symmetric and an antisymmetric part, which is a power- 
ful tool in other electromagnetic problems, is justified here. 
The RF electric field on the z axis becomes 


E, = 0, i ene pees 
2 2 
(on the fingers) 
(10) 
xy, = —9Y Grp Sie Ge pie 
; dz’ : 2 ° 2 


(between fingers). 


The #, field in region 2, 7’. , may be expressed by a space harmonic or 
generalized Fourier sum 


Ex = >> F, sinh 8,(W — x)e ™e™ (11) 
where 6,L = 6 + 2nz, W is the height of the waveguide as shown in 
Tig. 1, and F,, is the amplitude of the nth space harmonic component. 
Each term of the sum is the electric field of a TEM solutionto Maxwell’s 
equations, and the z dependence, exp (—j6,z), assumes periodicity of 
the resulting field pattern from finger to finger as required by (8). The 
as yet unknown amplitude coefficients Ff, are determined by letting 
x = 0 in (11) and equating the resulting expression with the field on the 
z axis (10). Thus, 

Vo ItDO —p (mt4) E+ (112) 


7 7 opf os 6 76. Zz 
sepa ae ate = oo = nd 12 
L sinh BW J nt4yn—(1/2) E oe 2 yom | : ee) 


where g’ = dg/dz and f’ = df/dz. The current on the surface of the mth 


PF, = 
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finger facing regions 2 and 3 can be found from the line integral of 
the z component of the RF magnetic field there, and finally Y2. becomes 


sin Bn(L - l) 





Y2(6) = Dy fe a > (—1)" 


ip Ey coth BraW 


2 (13) 
g pitt (L+1)/2 
* | eos -- g’ sin Bnz dz + sin = i f cos Brz a] 
2 J(r—1)2 2 J (1-1/2 


where Yo = 377 mho. 
In the case of constant-field approximation, as has been assumed by 
Fletcher,” 


g=1 
f = (22/l) — (L/l). 


The integration in (13) can be readily performed. For the case where 
L—1= L/2 andd = 1 (square finger cross section), 


(14) 











sin Bnb\" 
70 7 2sins (217 BL coth 6,1. (15) 
a 


Y,(6@) becomes, in the same constant-field approximation 


Y,(6) d . 28 
wo 4. L sin’ 5: 
The impedance of a finger K(0) = (Y,(6) + Y.(@))~ in the constant- 
field approximation is plotted in Fig. 2 in dashed lines as a function of 
6/m for W/L = 1.25 and 0.75. 

The constant-field assumption does not take into account the singu- 
larity in the field at the corner of the finger. It will be shown later that 
the calculated w-8 diagram using the impedance thus obtained disagrees 
severely with the measured one for 6 < 0.57. Harris et al.’ used the con- 
stant-field approximation for region 1 only and assumed the field pro- 
duced by an infinitely thin tape for regions 2 and 3. This will be referred 
to as the thin-tape approximation. The calculation used conformal map- 
ping and it is applicable directly only for 6 = 0, 7/2 and z. The thin-tape 
approximation assumes a 180° singularity at the corner of the finger and 
thus exaggerates the actual 90° singularity there. A further difficulty in 





(16) 
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— —— FROM THIN TAPE APPROXIMATION 


—---—FROM CONSTANT FIELD 
APPROXIMATION 











K(@) IN OHMS 
































Fig. 2— The impedance of the finger vs 0. It is assumed that L -l =l=d. 


this approach is that the fields on the boundaries between the regions 
2,3 and the region 1 are not matched. 

A third alternative would be to base the calculations on experimental 
data. The potential distribution functions g(z) and f(z) in (9) may be 
measured directly with a large scale two-finger model in an electrolytic 
tank. This is probably the most reliable method, although data are ob- 
tained with only limited accuracy and only numerically. In addition, 
the data are applicable directly with good accuracy only for @ = 0, =. 

The second (thin-tape) approximation will be used here, and it will 
be extended to cover the whole range of 6. With the thin-tape approxi- 
mation, the functions g and f, which specify the potential distribution 
along the z-axis, for 6 = O and z respectively, can be found by Schwarz- 
Christoffel transformation. They have a quite complicated form involv- 
ing elliptic functions, and therefore the integrals in (13) are difficult to 
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evaluate. Fortunately, at 6 = 0, 7/2 and z, K(6) can be found directly 
by Schwarz-Christoffel transformations without resort to (13). One can 
arrive at a good approximation of Y».(@) for the whole passband of the 
structure from (13) and the knowledge of Y2 at 6 = 0, 7/2 and z with- 
out evaluating the integral. The procedure is described in Appendix B, 
and the result is shown in Fig. 2 with solid lines for various values of 
W/L. Both the constant-field and the thin-tape approximations give 
about the same K(@) for 6 = 7/2 but there is a large difference near 

= 0. It will be shown later that K(@) resulting from the thin-tape ap- 
proximation yields a reasonably good agreement with experimental data. 


IV. THE EFFECTIVE DIELECTRIC CONSTANT €(@) 


If the comb is not entirely immersed in an isotropic medium, some 
RF field components appear in the direction of the fingers. The TEM 
assumption which has been used in Section IT no longer holds. However, 
if the medium is only slightly nonuniform, the TEM assumption is still a 
good approximation. Fortunately, TWM’s designed for high gain and 
large instantaneous bandwidth are so heavily loaded with an active 
crystal that the TEM approximation is reasonably valid and may be 
used in the structure analysis. In this section, an effective dielectric 
constant is defined. It is a function of 6 — that is, of the details of the 
RF electric field configuration. At a particular value of 6, the effective 
dielectric constant of e(@) is defined as the dielectric constant of a uniform 
medium filling the same comb structure, which results in the same total 
charge per comb finger as that produced by the true, incomplete diclec- 
tric loading (this definition is meaningful only for heavy dielectric loading 
such that the TEM approximation holds). It is obvious that this quantity 
e(9) is a very helpful one for the analysis. 

Referring to Fig. 1, a slab of dielectric of thickness D is placed in the 
regions 2 and 3. Its dielectric constant is assumed to be isotropic and 
equal to e. TEM-type solutions of Maxwell’s equations are assumed in 
regions 2 and 3, both inside and outside the dielectric slab. The com- 
ponents of the field vary as 


exp [6,7 — gky i JBnz] 


where k is the plane wave propagation constant for the respective media. 
Boundary conditions have to be matched at x = 0, D and W in the y-z 
plane. The constant-field approximation expressed in (14) is explicitly 
used for the boundary condition at 2 = 0. 




















Ot 0.2 03 0.4 0.5 0.6 0.7 08 09 1.0 


Fig. 3 — Square root of the effective dielectric constant vs 6. It is assumed that 
es ee a 
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Then the charge per unit length on a finger is given by 


(L-1)/2 ~d 
Q) = 2 i Eedz + 2 [ Kade (17) 
— (L—1)/2 “0 
where E. is the x component of the electric field in region 2 at x = 0 
and FE. is the z component of the electric field in region 1. An effective 
dielectric constant ¢«(6) is now defined by requiring the same amount of 
charge Q to exist on the finger as if the whole structure were immersed 
in a medium of dielectric constant e(@). The resulting formula can be 
given for combs with rectangular fingers; however, for simplicity, only 
the result for equally spaced square fingers (L — 1 = d = L/2) is given 

here 














Pee oe 
ae _fsinsp 
ee (—1) aL 
4 
1 + €coth 6,(W — D) coth buP) 18 
G + ecoth B,(W — D) tanh 6,D nD Se 
e(8) = 2 
. Bria 
ae _ [sin 
sin 5 -|- 5 2 (—1) BL coth 6,W 
4 


The result of machine computations of e(6) using (18) and assuming a 
dielectric constant of the loading dielectric of ¢ = 9 (approximately the 
value of ruby) is shown in Fig. 3(a), (b), and (c). To facilitate other 
computations which will be discussed later, the square root ~/e(6) is 
shown in these graphs rather than ¢«(6). For a fairly complete loading 
(D/W > 0.8), «(@) approaches 5 near 6 = zw. Near 6 = 0, e(6) varies 
widely, depending on D/W. This can be understood by noting that near 
6 = 7 the RF fields concentrate near the fingers and hence e(@) is little 
changed by a change in the width of the air gap near the waveguide wall. 
On the other hand, near 6 = 0 more of the fields reach the waveguide 
wall. Thus the width of the air gap there affects the magnitude of (0) 
more drastically than that of e(7). 


V. PROPERTIES OF THE COMB STRUCTURE 


With the knowledge of K(@), C(@) and «(@), one can calculate the 
w-8 diagram of both the empty and the loaded comb structure. The 
calculation reveals a number of interesting properties of the comb struc- 
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ture and also suggests various techniques for narrowing the passband 
of the structure. 


5.1 Empty Comb Structure 


As a maser structure, the comb is always loaded with one or more 
dielectrics. However, the study of the empty comb is of some interest to 
us, since it offers the possibility of checking the accuracy of our im- 
pedance calculations by measurements. 

The fringe capacitance C'(@) for 6 = 0, and 6 = z has been measured 
in the electrolytic tank. Values for C(0) and C(7) are shown in Fig. 4. 
They were obtained by resistance measurements on a large scale model 
of a comb finger in a tank. The values are plotted versus the distance 
between the finger tips and the opposite waveguide wall, s. The data are 
valid for fingers of square cross section, L/2 * L/2 = 0.040 X 0.040 inch, 
spaced center-to-center by LZ = 0.08 inch and contained in a housing of 
width 2W + L/2 = 0.240 inch (aspect ratio W/L = 1.25). It should be 
mentioned here that these data can be applied to dimensions other than 
those indicated if one observes two facts. First, if all linear dimensions 
are scaled simultaneously by some factor, the capacity is scaled by the 





C (ar) AND C(O) IN MICROMICROFARADS 





SIN MILS 


Fig. 4 — The fringe capacitance at 6 = 0 and 6 = =z vs the spacing between the 
fey tip and the waveguide wall for the case where L — 1 = 1 = d = 0.040 inch. 
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same factor. Second, experience has shown that the fringe capacity is a 
very slow function of the ratio W/L; no noticeable errors were found 
when these capacity values were used for W/L values ranging from 0.75 
to 1.5. Unfortunately, C(@) for @ other than 0, 7/2 and 7 cannot be 
measured in a simple tank model. However, some indication of how C(@) 
changes with @ may be obtained experimentally. One would start with a 
measured dispersion curve of an empty comb; one would assume that the 
values K(@) in Fig. 2 for the thin-tape approximation are sufficiently 
accurate; then (1) offers a possibility of evaluating experimental values 
of C(@). In practice it turns out, however, that this approach does not 
yield values C'(6) of sufficient accuracy to determine the exact shape of 
the C(6) function. 

In Fig. 5, calculated dispersion curves of several empty comb struc- 
tures with dimensions as shown are given in solid lines. The calculation 
is based on K(@) of Fig. 2 using the thin-tape approximation. C(@) is 
assumed to change linearly with 6 from 6 = 0 to 6 = z. Measured points 
of the dispersion curves are also shown in Fig. 5. The measurements and 
calculation agree well. 

One case of a dispersion curve calculated by using K(@) from the 
constant-field approximation is also shown in Fig. 5 by the dashed line. 
It deviates considerably from the measured points for @ < 0.5 7m. 

One may conclude that both the impedance of a finger calculated by 
using the thin-tape approximation and the assumption that C(6) changes 
linearly with @ are sufficiently accurate for the present analysis. 

The passband of the empty structure can be narrowed by reducing the 
width of the waveguide housing 2W + d. Then the impedance values 
K(@) at @ = w approach each other more closely and so do the Ah 
values at 6 = 0 and 6 = zw. From (6), one readily sees that a narrower 
passband results. 


5.2 Loaded Comb Structure 


In this section, the discussion is restricted to the case of dielectric 
loading on both sides of the comb. Both loading slabs are parallelepipeds 
of equal thickness D. Both slabs cover the full finger height from the 
root to the tip, i.e., the height h of the fingers is also that of the loading 
slab. In addition, a comb of equally spaced square cross section fingers 
(L —1l =1 = d) is assumed. 

Since (8) is only an approximation and also since the dielectric con- 
stant of ruby is neither a scalar nor exactly 9, one cannot expect to 
obtain a close quantitative agreement between the measured and the 
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SOLID LINES ARE CALCULATED 


BROKEN LINE IS CALCULATED USING K(@) 
FROM THE CONSTANT FIELD APPROXIMATION 
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Fig. 5 — Calculated and the measured w-8 diagram of the empty comb. L — 1 = 
l = d = 0.040 inch. 


calculated w-8 diagrams. However, the effects of various loading dimen- 
sions on the w-6 diagram are correctly predicted by the theory, and the 
present analysis provides a reliable basis for an initial choice of design 
parameters. 

It will be convenient to define f(a) and f(0) as the frequency at which 
6 = x and 0, respectively. For a forward-wave structure (df/d@ > 0), 
f(r) > f(0), and for the backward-wave structure (df/d@ < 0) f(r) < 


(0). 
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In Fig. 6, calculated dispersion curves are shown for W/L = 1.25, 
K(ax)C(a)e/h = 0.05 and various values of D/W. The frequency scale 
is normalized to fo(), where fo() is the frequency of the empty comb 
(D/W = 0) at @ = zw. Without loading (D/W = 0), the structure is a 
forward-wave structure. Relatively thin slabs of ruby loading (see the 
curve for D/W = 0.2) make it a backward-wave structure of a compara- 
tively wide bandwidth. This can be understood by observing that the 
RF fields near 6 = z are more concentrated near the fingers than the 
RF fields near 6 = 0. Hence, the RF fields at 6 = a see more of the pres- 
ence of the thin ruby slab than the fields at 6 = 0. In this way f() is 
reduced while f(0) remains essentially unchanged. Further increases in 
the width of the loading (see the curves for D/W = 0.4 and 0.6) reduce 
the bandwidth of the backward-wave structure. This happens because 
the RF fields near 6 = 0 begin to interact with the dielectric slab, while 
the fields near 6 = 7 are almost completely contained in the initial thin 
slabs. This again changes the frequencies at 6 = 7 and 0 to a different 
extent and thus reduces f(0) more than f(z). 

Ata still greater dielectric slab thickness (see the curve for D/W = 
0.9) the structure is forward with a fairly narrow band and finally, with 
complete loading (see the curve for D/W = 1.0), it is forward with a 














Ses 
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Fig. 6 — Calculated w-8 diagram as the thickness of the dielectric is changed. 
It is assumed that L — 1 = 1 = d, W/L = 1.25, K(x)C(x)c/h = 0.05, and e = 9. 
The frequency scale is normalized to fo(z), the frequency at @ = = for the empty 
comb, 
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somewhat wider band. The explanation is again based on the fact that 
increasing the dielectric width reduces the lower cutoff frequency while 
leaving the upper one the same. 

The most important aspect of this behavior is seen by comparing the 
curves for D/W = 0.6, 0.8, and 0.9 in Fig. 6. It is apparent there that 
the transition between backward and forward structure does not involve 
backward-wave structures of gradually decreasing bandwidth, and then 
forward-wave structures of initially very narrow and eventually wider 
bandwidths. If this were so, it would be very easy to design comb struc- 
tures with extremely high slowing of the group velocity. Instead, it is 
seen that the transition from backward to forward-wave structures 
takes place via intermediate structures showing ‘‘fold-over.” By this, 
we mean a structure which for some range of @ is forward, whereas it is 
backward in the remainder of the @ range. It has been pointed out? that 
such a situation leads to instability and oscillations in a traveling-wave 
maser amplifier in spite of the incorporated isolator. Thus, for all 
practical applications, the occurrence of fold-over has to be avoided. 

It may be added here that dispersion curves of combs with other di- 
mensions vary in a similar fashion as the thickness of the dielectric slab 
is varied. It is of interest, however, to find out how the onset of fold-over 
is related to the comb geometry, i.e., to the ratio W/L, and to the finger 
end capacity. Here it is particularly desirable to have analytical data 
which indicate what choice of the W/Z and D/W ratios and of the finger 
end capacity will result in the greatest slowing of the group velocity 
near the center portion of the passband, but still avoid fold-over. For this 
purpose, a number of dispersion curves normalized to f(a) were calcu- 
lated and are shown in Fig. 7. One notices that fold-over takes place 
rather abruptly for D/W < 0.9 for all cases. In addition, one sees that 
the minimum group velocity [a(df/d@)| attainable near the center of the 
passband without fold-over decreases by reducing W/L and Ah()/h. 
The group velocity near the center of the passband is more or less ar- 
bitrarily defined as 


_ 7 2(f loo.7r — Ff leno.ar) 
ena | eee oe 


It is shown as a slowing factor S = c/v, in Fig. 8 vs W/L for D/W = 0.9 
and for different values of Ah(a)/h. The largest slowing is obtained for 
the smallest W/L. It should be pointed out, however, that our present 
calculations were not carried out for W/Z ratios of 0.5 or smaller, al- 
though such values would increase the slowing still further. Values of 
W/L < 0.5 appear unsuitable for practical maser designs for various 
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Fig. 7 — w-6 diagram for different W/L and Ah(x)/h. D/W = 0.9, L—lL=l=d. 
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Fig. 8 — Slowing factor vs W/L. Measured points are also shown. The measure- 
ments are made with structures having Ah/h = 0.02, A(x) = 9.9 cm, and L = 0.08 
inch. 
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reasons, among them the difficulty of incorporating an isolator into a 
comb structure with such dimensions. Larger slowing results also from 
reducing Ah(a)/h. Ah() can be varied over a narrow range by varying 
C, which is a function of s, the distance between the finger tip and the 
opposite waveguide wall. For the comb geometry used for measuring the 
fringe capacity, C(a) does not appreciably decrease for s beyond 
s = 0.040 inch. More generally, this would be done if s exceeds a value 
comparable to the finger “diameter.” This therefore sets a minimum for 
Ah(r)/h at a given operating frequency. 

To illustrate the significance of Fig. 8, consider two TWM7’s A and B 
with exactly the same dimensions except that the finger length h of A 
is twice that of B. This difference in h makes the operating frequency of A 
about one-half of that of B. For the time being, assume also that the 
magnitude of the magnetic Q of the active crystal is independent of fre- 
quency. With the values W/L = 1.0 and Ah(r)/h = 0.02 for A, Fig. 8 
shows that SL/\(r) = 3.2. For B, Ah(w)/h = 0.04, so that SL/\(r) = 
2.5. The db gain of a TWM is proportional to fS/Qm , which in turn, for 
constant Q», is proportional to SL/d(a). Thus one should expect that 
the db gain of maser A is larger than that of maser B by a factor 1.28 
(= 3.2/2.5). In practice, however, | Q., | usually increases toward lower 
frequencies, so that the gain of the lower-frequency maser A tends to be 
lower. 

It should be added that the slowing itself, S, is inversely proportional 
to a scale factor L/\(a) where \(7) = c/f(a) is the free-space wave- 
length of f(a). Thus for combs with a given period length DL, the slowing 
is greater for lower frequencies. It follows that the slowing factor S is a 
meaningful parameter in comparing different comb structures only if 
they have essentially the same period length, Z, and operating frequency 
range, f. 

Consider another hypothetical case. Suppose the fringe capacity 
vanishes, C(m) = C(0) = 0, so that Ah = 0. Experimentally, this situa- 
tion could be realized by a \/2 ladder structure where fingers of twice 
the comb structure finger length are anchored at both ends in a wave- 
guide enclosure. Without dielectric loading, this case would be that of the 
Easitron structure (see Ref. 7) which is characterized by a zero pass- 
band. With dielectric loading, however, a finite passband results never- 
theless. This is due to the variation of e(@) with 6, and in particular the 
difference between e(7) and e(0). Under these circumstances one might 
expect that the bandwidth is smaller and hence the slowing greater than 
in the case of a structure with finite fringe capacity and finite Ah. This 
is indeed the case. There is no curve shown in Fig. 8 for the parameter 
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Ah = 0, but it is obvious that this curve would lie above that for 
Ah(r)/h = 0.02. 

Measurements of dispersion curves of several comb structures loaded 
with alumina (e = 9.3) were made and they are shown in Fig. 9. Those 
parts of the dispersion curves where @ = 7/2 depend very little on W/L 
and D/W. The fold-over takes place at D/W between 0.85 and 0.90. 


The maximum slowing factor near the center of the passband which is 
attainable without fold-over increases with smaller V/Z. These general 
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Fig. 9 — Measured w-8 diagrams of loaded comb structure. 2 —-1=l=d= 
0.040 inch. 
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features agree with the results of the calculation. There is a deviation 
of typically about 4 per cent near @ = a, and of about 10 per cent near 

= 0 between the absolute values of measured and calculated frequen- 
cies. These discrepancies are not too disturbing, however, since (3) is 
only an approximation; the fringe capacitance data derived for an empty 
comb change when the comb is loaded up to the tip of the fingers, and the 
dielectric constant of alumina is not exactly 9 as used in the calculations. 
The slowing factors calculated from these measurements are also shown 
in Tig. 8 as circled points. The measured slowing factors are about 30 
per cent smaller than those calculated, but the dependence of S on W/L 
is correctly predicted by the theory. 


VI. TECHNIQUES FOR REDUCING THE STRUCTURE BANDWIDTH 


Since a structure with smaller bandwidth gives a larger slowing factor 
and thus a larger maser gain, the bandwidth of the structure should be 
kept as small as possible. It is necessary, of course, that the structure 
bandwidth exceed the required instantaneous or tunable design band- 
width of the maser amplifier by some reasonable, safe margin. This 
restriction was not important until recently. In earlier phases of travel- 
ing-wave maser development, it was difficult to design comb structures 
for sufficiently high slowing without running into the fold-over condition. 
More recently several techniques were developed which make it possible 
to design loaded comb structures with almost arbitrarily narrow band- 
widths, down to structure bandwidths of only twice the instantaneous 
amplifier response.° These techniques were derived both by experimenta- 
tion!° and by the theoretical considerations reported in this paper. They 
include the following: (¢) Near the tip of the fingers, the ruby may be 
shaped as in Fig. 10(a) or (b) by a step or bevel undercut. (zz) A strip of 
dielectric material of a high dielectric constant may be added next to 
the finger tips as in Fig. 10(c). (ziz) The thickness of the comb fingers, d, 
may be reduced (see Fig. 1). These three techniques may be applied 
either independently or together to reduce the bandwidth of the struc- 
ture. 

By shaping the dielectric near the finger tip as in Fig. 10(a) and (b), 
the effective dielectric constant «(@) is reduced. The reduction is not 
uniform across the band, but e(@) is more drastically decreased for 6 near 
0. Thus, for a forward-wave structure where the lower cutoff frequency 
occurs at @ = 0, the lower cutoff frequency increases without much 
change to the upper cutoff frequency. The bevel shape of Fig. 10(b) can 
be considered as a series of small steps, as indicated by the dashed line. 
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Fig. 10 — Techniques of ‘‘shaping’’ and ‘‘K34 loading”’ near the finger tip to 
reduce the bandwidth. 


The dielectric shapes of Fig. 10(a) and (b) are approximately equiv- 
alent in their role of narrowing the bandwidth. 

The finger tip loading may take the form shown in Fig. 10(c). A thin 
slab of dielectric material with high dielectric constant is imbedded in 
alumina or ruby near the finger tip. In our experiments, a ceramic with 

= 34 manufactured by American Lava Corporation was used. This 
material will be referred to as 84. This type of finger tip loading in- 
creases the effective dielectric constant ¢(@) near @ = 7m faster than near 

= (. For the forward-wave structure, this means that the upper cutoff 
frequency can be decreased faster than the lower cutoff frequency, and 
thus a narrowing of the bandwidth results. 

Let h’ be the length of the stepped dielectric (Fig. 10a) or K34 (Fig. 
10c), C’ be the capacitance looking toward the finger tip at the plane 
A-A, é’ be the effective dielectric constant of the section h’, and D’ be 
the thickness dimension as shown. The capacitive impedance looking 
toward the right side at the plane A-A, 1/wC’, can be found by regarding 
the section h’ as a TEM transmission line with the characteristic im- 
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pedance K(0)/~/e’(@) and a propagation constant wv/e'(@)/c and 
terminated by C at the end. If the electrical length of the modified sec- 
tion of finger line (which has the physical length h’) is small compared to 
a quarter wavelength, 


tan eV EO) h! Dy ee) h’, 


and if the fringe capacity loading at the finger tip is small, 
1 
iy K(@) 
wT “A / (0) 
then the effect of the stepped dielectric can be expressed by an effective 
capacity C’ 





C's Cll + (eh'/KCc)] (20) 


which effectively terminates the regular TEM finger transmission lines 
of length h-h’. From (6) and (20) one obtains the following formula, 
which contains implicitly the w-8 relation 


/ 
aes 1) rw V/A) (: A ge é - £n'). (21) 

For the stepped ruby as in Fig. 10(a) where D > D’, (e — ©€)/We 
is negative. This quantity can be calculated from Fig. 3 and it is shown 
in Fig. 11 in solid curves. Measured values are also shown as circles. 
The measurements and calculations of (e’ — ¢)/~/e agree well except 
near 6 = OQ. In practical design work, it is often convenient to keep one 
of the cutoff frequencies unchanged while shifting the other cutoff fre- 
quency by shaping the dielectric. This requires a large ratio of 


| (e’ — €)/Vel 


at the two cutoff frequencies. In Fig. 11, the curves for D/W = 0.95 
and 0.4 S$ D’/W S 0.6 satisfy this requirement. As a special case of 
stepped ruby, the value D’ = 0 may also be considered. This corresponds 
to a ruby loading of rectangular cross section which does not cover the 
full finger height h. Then ¢’ becomes unity and the value (1 — e)/~/e 
is also plotted in Fig. 11 in the top curve marked D’/W = 0.0. Since the 
ratio (1 — «)/~/e at both cutoff frequencies, that is at 6 = 0 and at 
6 = zr, is not large, the reduction of the ruby height does not appear a 
promising way to narrow the bandwidth of the comb structure. 

For K34 loading, (e’ — «)/+/e becomes positive and can be evaluated 
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Fig. 11 —| vs 6/m for ruby shaping and K34 loading. 





Ve 
from measured data with the help of (21). Two of the measured curves 
of (e& — €)/+~/e are shown in Fig. 11 in broken lines. (e& — ©)/+/é is 
larger near 0 = x than near 6 = 0. For the forward-wave structure this 
causes the upper cutoff frequency to decrease while leaving the lower 
frequency almost unchanged. 

The upper cutoff frequency of the forward-wave structure can also be 
reduced by using thin rectangular fingers (1.e.,d@ < L — 1). e(@) of the 
comb structure with fingers of square cross section was shown in Fig. 3. 
It was also shown that D/W = 0.9 --- 0.95 is usually the best choice to 
reduce the bandwidth of the passband and yet avoid fold-over near 
6 = 0. For this value of D/W, one notices in Fig. 3 that «(0) > (7). 
In order to reduce the bandwidth further, one may increase e(7) so that 
it approaches ¢«(0). This can be done by reducing the dimension d of the 
fingers. Since the fields between the fingers are almost negligible at 
6 = 0, the thickness dimension of the fingers does not affect «(0). On 
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the other hand, near 6 = 7z the fields see more of the ruby with thin 
fingers than with square fingers; thereby e(7) increases. 

In Fig. 12, ~We(x) vs d/(L — 1) using the expression given by Harris 
et al.’ is shown. Three measured points are also indicated. The expression 
of e(a) in Ref. 7 assumes a uniform field between the fingers. This ap- 
proximation is less justified as the fingers become thin, although it should 
be qualitatively correct even for d/(L — 1) < 0.5. 

Both the K34 loading and the choice of finger thickness affect the fre- 
quencies near 6 = 7. Band narrowing by thin fingers has the additional 
advantage that the filling factor improves somewhat compared to the 
use of J(34 loading and of square fingers. 

All the three techniques described here can be combined to narrow the 
passband very effectively in such a way that fold-over is still avoided. 


VII. HIGHER-ORDER MODES 


It has been shown in Section II that there exists a series of higher-order 
passbands for a comb structure. 

Let us compare the first and the second modes of operation from two 
different approaches. Equation (6) shows that for a given 6 the free-space 
wavelength of the first mode is three times longer than that of the second 
mode when all of the dimensions of the structure are kept the same. Thus 
the percentage bandwidth of the passbands is approximately the same 
for all modes. By using a frequency scale for the first mode which is 




















Fig. 12 — The square root of the effective dielectric constant at 6 = x for ‘“‘thin 
fingers.’’ It is assumed that e = 9 
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one-third that for the second mode, the w-@ diagram of the first and the 
second modes should coincide. This is indeed verified from measured w-8 
diagrams of the first and the second modes of the structure designated as 
A in Fig. 13. 

On the other hand, when W/L, D/W and s are kept unchanged while 
the length of the finger h is made three times longer, the passband of the 
smaller structure operating in its first mode will be in about the same 
frequency range as that of the larger structure operating in its second 
mode. However, the bandwidth of the smaller structure in the first mode 
is larger than that of the larger structure in the second mode. This is due 
to a larger Ah/h for the structure with the smaller h. A structure B with 
a finger length of one-third that of structure A was built, and the w-@ 
diagram of its first mode is also shown in Fig. 13. One notices that the 
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Fig. 13 — Comparisons of the first and the second passbands of the loaded 
sony structures. The fingers are 0,040 x 0,040 inch in cross section and / = 0,040 
mech, 
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first mode of B has a larger bandwidth than the second mode of A. The 
larger slowing factor associated with the smaller bandwidth is one of the 
advantages of higher-order modes, but it is not an essential factor, since 
there are more efficient techniques of reducing the available bandwidth. 
However, the higher-order modes may be advantageous when the re- 
quired microwave frequency of the passband is so high that it becomes 
difficult to fabricate comb fingers of a quarter-wave electrical length, 
provided the waveguide modes which may propagate in such a struc- 
ture can be suppressed. Then the larger structure would permit reduced 
mechanical tolerances. 


VIII. DESIGN CONSIDERATIONS 


Since the gain of the maser increases as the bandwidth of the passband 
of the structure is reduced, it is preferable to design a structure having 
the narrowest bandwidth compatible with the requirements on the useful 
bandwidth of the maser. 

The choice of various dimensions of the structure to obtain a given 
center frequency and to approach the narrowest bandwidth possible 
without fold-over will be discussed here. 

From the curves in Fig. 8, a small value of W/Z is preferred. Other 
design considerations may dictate the smallest permissible W/L value. 
In all our experiments, for example, a period length Z = 0.080 inch was 
chosen. For W/L smaller than 0.7 it would seem rather difficult to in- 
corporate a high-performance isolator into the structure. Thus W/L = 0.7 
is a compromise optimum value. 

With given dimensions of the empty comb and without ruby shaping 
near the finger tip, the bandwidth is reduced by gradually reducing D/W 
until fold-over sets in near 6 = 0. This happens at about D/W ~ 0.9. 
Yold-over appears more readily if additional “ruby shaping”’ is applied 
to the structure. Thus D/W should be larger than 0.9 in order to allow 
for some latitude in ruby shaping. From our experience D/W = 0.95 is 
a suitable value. 

For the step in the ruby near the finger tip (Fig. 10a), D’/W may be 
taken between 0.4 and 0.6. Then | (e’ — €)/+/e| at 9 = 0 is large, and 
the ratio of its magnitude at 6 = 0 and @ = 7 is also large. 

The frequency near the @ = zw end can be easily controlled by using 
thin fingers. The curve in Fig. 12 may serve as a guide in the choice of a 
suitable finger aspect ratio d/(L — 1). 

The final parameter yet to be determined is the length of the fingers, 
h. For an initial design, one may use the midband value of ~/e(2/2) 
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from Fig, 3. Then 


aN 
"STV ela/D) 
where \ is the free-space wavelength of the given center frequency. 
The three techniques discussed above for narrowing the bandwidth affect 
primarily the frequencies near 6 = 0 and @ = 7, i.e., close to both cut- 
offs; but the frequencies near 6 = 7/2, 1.e., close to midband, remain 
nearly unchanged. 

Following the suggestions given here, one should arrive at a first-order 
design for a TWM which will perform fairly close to the theoretical ex- 
pectation. Experience has shown that a small additional amount of fine 
adjustment is needed in order to have the traveling-wave maser perform 
according to the specifications. This may involve control of the center 
frequency, adjustment for more or less slowing in order to obtain the 
design gain, or adjustment of the curvature in the w-8 diagram so as to 
realize a flat gain-versus-frequency characteristic. The theoretical data 
provided in this paper make it rather easy to determine the appropriate 
design modifications. 
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APPENDIX A 


The Potential Distribution on the z Axts 


Consider the coordinate axes shown in Fig. 14. The potential on the z 
axis for —(1/2) < z < (l/2) can be expressed as 


x 


Z 
| (m)th | (m+1)th 


Fig. 14 The coordinate system used in Appendix A. 


nfo 


il 
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V(6,2) = V.(@)g(z) + Val 8)f(z) (22) 


where V,(@) and V.(@) are those parts of the potential which are sym- 
metrical and antisymmetrical with respect to the « axis, respectively, 
and g(z) and f(z) describe their variation along the z axis. They are 


defined so that 
l l 
_— <—s ais = 2 
o{-5)=o(3) = oy 


ze 


Substituting (23) and (24) into (22), the potentials of the mth and 
(m + 1)th finger become, respectively, 


Ae ™ = V,—V, (25) 
Ae ity? = Ve + Va. (26) 
From (25) and (26), 
Ve = Ac i"*? cog (6/2) (27) 
Va = —jAg "sin (6/2). (28) 


Then (22) becomes 
V(02) = Ae*"*"(g(z) cos (0/2) — jf(z) sin (0/2)]. (29) 


Assuming the fingers to be infinitely thin, g(z) and f(z) can be found 
from a Schwarz-Christoffel transformation. This transformation is de- 
scribed in Ref. 7. The result can be given in closed form for the derivatives 
g’ = dg/dz and f’ = df/dz: 





cos oF 
he a™ Tv oe ee, 
rer sao aW a (30) 
L sin? ay 0.5 F + In V2 
f' — a ee ~ 
L 4/ sine 2 _ 5 1854 (31) 
21 


where it is assumed that //LZ = 0.5. In deriving (30), an elliptic function 
has been approximated by a sine function. This approximation becomes 
better with larger W/L. The origin of the z coordinate for (80) and (31) 
is the same as originally indicated in lig. 1, which differs from that shown 
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in ig. 14. It is seen that an infinity occurs both for g’ and f’ at the finger 
corners, 2 = 1/2. 


APPENDIX B 


Impedance of a Finger Based on the “Thin-Tape” Approximation 


It was shown in Section IIT that the admittance of a finger can be 
found by adding the admittance between the fingers Y; , and the admit- 
tance between the finger and the waveguide Y,. Y2 can be expressed in 
terms of the potentials f and g along the z axis of Fig. 14. f and g are the 
potentials at 6 = a and @ = O, respectively. We shall assume f and g 
to be those obtained by conformal mapping of the thin-tape geometry. 

By separating the term with n = 0 from the remaining summation, 
(13) may be rewritten in the following form 





28 
sin — 1/2 
¥2(8) = 4 coth (« ¥) x 2 jsin§ if cos og dz 
1/2 


Yo 6 L L L 
4 
+¢ ee [Og sin ve dz\+ 3 ep ‘ 
Bee) —1 {2 J TE n=—0 (32) 
nxX~0 
. 0 nT 
sin (7 + nn) 1/2 
an ae ae (sin ‘) Z Pe cos (6 + 2nr) L@ 
4 2 
for 
L—-l=le=d 
where 


r=0 for n>QO 
r=1 for n<0O. 


The terms involving g are omitted except for n = 0, since usually g’ « f’. 
The coordinate origin used in this equation is at the position shown in 
Fig. 14. 

Let Yz, Ya and Yu be Y2(6)/Yo at 6 = 0, r/2 and 7a, respectively. 
These quantities can be obtained directly by conformal mapping. Y, 
and Yy are shown in Ref. 7. Y x, is obtained by a similar procedure and 
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Fig. 15 — The impedance K (6) of a finger at 6 = 7/2. 


the result is shown in Fig. 15. For 6 other than these three values, (32) 

should in principle be used to find Y2(@). Fortunately, Y2(@) can be ex- 

pressed approximately in terms of Y; and Y» or Yy in an interpolation 

formula, and thus the tedious evaluation of (32) can be avoided. 
Letting 6 = 0 and then 6 = 7/2 in (82), one obtains, 





1 1/2 ; 9 1/2 ; 
Y, = wl bind dz + ae 2g ae| (33) 
sin = 
= 8 : awW OF V2 [ 7 _ We 
Yu = = coth OA (sin yet fd COs 5F de 
8 
1/2 , x2 Dy) c) <i 
+ [9 sin 3 a | +72 ey (34) 
n¥0 
_ (wr, nT 
sin G + “= ) 1/2 
pA a Bere cd 
z=) ne sin 7 lat cos (§ +2nr) 7, 
8 2 


We shall assume that approximately 


1/2 1/2 
f dz = 


—1f2 —1]2 


f' cos : dz (35) 
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sin (? + nn) sin (2 + me) 
so ees INE Ee forn #0 (36) 











8 nt T ni 
ri st 2 
— z 1 / T zZ 
/ is : 7 z 
eae cos (6 + 2nr) r dz = Ee cos ( + 2nn) 7 (a 
forn # 0 
1/2 1/2 
Pi ots dz vo 2 | ] 
Le ec ama fe es (38) 
Then (32) becomes 
sin = sin l sin = 
Pe” 4 W ” 8 Ww \2W 
Y.(0) = sae Yu t+ ; coth 6 —- 7 coth SF a 
sin — = ia 
4 4 8 
(39) 
sin d 
- 6 4 W 6 _ 6 4 ie : 
sin 5 Y, + ; (coin WV (coss sin ‘) Tiles g zdz. 
4 


The last term of (389) is small compared to the other terms, and it can 
be found by numerical integration of g’ in (30). 

One notices that Y2(@) of (89) becomes Y; and Yy when 6 = 0 and 
6 = 1/2, respectively. One may estimate the error involved in (39) by 
letting 6 = aw and compare it with Yy obtained by conformal mapping. 
For W/L = 1, Yo(7) obtained from (39) gives a value 10 per cent smaller 
than that obtained directly by conformal mapping. The admittance of a 
finger which is the sum of Y; and Y2 thus has an error of about 4 per cent 
at 9 = x. For other values of 6, the error would be even smaller. 


REFERENCES 


1. DeGrasse, R. W., Schulz-DuBois, E. O., and Scovil, H. E. D., The Three-Level 
Solid-State Traveling-Wave Maser, B.8.T.J., 38, Mar., 1959, pp. 305-334. 

2. Fletcher, R. C., A Broadband Interdigital Circuit for Use in Traveling-Wave 
Tube Amplifiers, Proc. I.R.E., 40, Aug., 1952, pp. 951-958. 

3. Butcher, P. N., The Coupling Impedance of Tape Structures, J. IEE, 104B, 
Mar., 1957, pp. 177-187. 

4, Walling, J., Interdigital and Other Slow Wave Structures, J. Electronics, 3, 
Mar., 1957, pp. 239-258. 

5. Ash, E. A., and Studd, A..C., A Ladder Structure for Millimeter Waves, I.R.E. 
Trans. Electron Devices, ED-8, July, 1961, pp. 294-302. 


1066 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


. Tabor, W. J., A 100-Mc Broadband Ruby Traveling-Wave Maser at 5 Ge, Proc. 


IEEE, 51, ‘August, 1968, p. 1143. 


. Harris, 8. DeGrasse, R. W., and Schulz-DuBois, E. O., Cutoff Frequencies 


of the Dielectrically Loaded Comb Structure as Used in Traveling-Wave 
Masers, B.S.T.J., 43, Jan., 1964, p. 487. 


. Haddad, G. I, and. Rowe, J. E., S Baad Ladder-Line Traveling-Wave Maser, 


LR.E. Trans. on Microwave Theory and Techniques, MTT-10, January 
1962, pp. 3-8. 


. Kostelnik, J. J., DeGrasse, R. W., and Scovil, H. E. D., The Dual Channel 
10. 


2390-me Traveling- Wave "Maser, BS.T.J. , 40, July, 1961, pp. 1117-1127. 
DeGrasse, R. W., and Hensel, M. Li. o private communication. 


A Comparison of Permanent Electrical 
Connections 


By G. W. MILLS 
(Manuscript received February 10, 1964) 


A study has been completed which compares four types of permanent 
electrical connections (soldered, solderless wrapped, percussive welded, and 
resistance welded) under environmental conditions of vibration, shock, tem- 
perature extremes, corrosion, humidity, and bending. Only good-quality 
connections were included in this study, and they represented the current 
state-of-the-art for each type. Under these conditions the connections showed 
no significant degradation in their electrical characteristics as long as they 
remained mechanically secure. Differences in the four types of connections 
were therefore assessed in relation to their mechanical characteristics. Con- 
sequently, one of the more important results of the study was the recognition 
of fatigue life as the most important mechanical connection characteristic 
when comparing connections which meet the high standards of the Bell Sys- 
tem for electrical stability. Using fatigue life as a basis for comparison and 
soldered connections as a reference standard, the major conclusions with 
regard to general wiring (the connection of wires to terminals, such as sur- 
face and local cable wiring) are as follows for the conditions that existed in 
this study: 

(a) monitored percussive welded connections are superior to soldered 
connections; 

(b) over-all, solderless wrapped connections are essentially equivalent 
to soldered connections; 

(c) resistance welded connections are significantly inferior to soldered 
connections. 

Although differences were found among the types of connections, no evi- 
dence was obtained that any of the connection types are not satisfactory as 
presently used in normal Bell System applications. 


I. INTRODUCTION 


The Bell System uses many types of electrical connections. The best 
connection for a specific application is chosen on the basis of the relative 
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merits of the connections in the following general areas: 

(1) adaptability of each connection for the application under consider- 
ation, 

(2) reliability or life required from the connections under the environ- 
mental conditions in which the equipment must operate, and 

(3) relative cost of each connection. 

This study was concerned with obtaining information on the compara- 
tive reliability or life of the four main types of permanent connections 
(solderless wrapped, soldered, percussive welded, and resistance welded) 
under various environmental conditions. The probability of the occur- 
rence of substandard connections was recognized as a factor in deter- 
mining the reliability of a given type of connection. However, only good- 
quality connections of each type were compared in this study, and all 
present manufacturing standards were followed in making them. 

All four types of connections are considered adequate for the applica- 
tions in which they are presently used, and the results of this study should 
not be construed as a recommendation to change to a different type of 
connection in these applications. 

The scope of the program is illustrated by Tig. 1. It shows general 
wiring application (the connection of wires to terminals such as surface 
and local cable wiring) and the environments considered. 

Some of the environments chosen are more severe than those normally 
encountered in a central office. This was necessary to produce a measura- 
ble effect in a reasonable time and should not alter the results, because 
this was a comparison study. 


GENERAL WIRING APPLICATION 


DESCRIPTION: 
CONNECTION OF SUBASSEMBLIES 
EXAMPLE: 
LOCAL CABLE AND SURFACE 
WIRING 
CONNECTIONS COMPARED: 
1, SOLDERED ; 
2. SOLDERLESS WRAP 
3. PERCUSSIVE WELD 


4. RESISTANCE WELD 


ENVIRONMENTS 


{. VIBRATION 

2. SHOCK 

3. TEMPERATURE EXTREMES 
4. CORROSION 

5. HUMIDITY 

6. BENDING 





Fig. 1 — Test program for permanent connection comparison. 
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The end result from each of the environmental tests was the establish- 
ment of an “order of merit” for the four types of connections in that 
particular environment. Thus the connection which suffered the least 
degradation in an environment would be placed at the top of the ‘order 
of merit”’ list and the connection which showed the greatest degradation 
would be on the bottom. 

Early in the test program, it was observed that none of the test en- 
vironments caused any electrical failures and that very few of the test 
environments caused mechanical connection failures. Thus, it was con- 
cluded that the establishment of an ‘order of merit”? would have to be 
obtained from some observed mechanical degradation in destructive 
tests. Two types of tests always led to conncction failure, i.e., breakage, 
and these were the vibration and bending fatigue tests. The establish- 
ment of an ‘‘order of merit’ for these tests was very simple: the connec- 
tions which lasted the longest were the best and those which failed first 
were the poorest. . 

Since both of these tests are essentially fatigue tests, it was realized 
that the fatigue life of a connection could be a better basis of comparison 
than an ultimate-strength test, since all the test connections had met the 
rigid Bell System standards for electrical stability. Further analysis, as 
described in Section 3.1, led to the conclusion that fatigue life should be 
used as the comparison basis throughout this study for the following 
reasons: 

(1) A comparison based on fatigue life offers an absolute comparison 
scale for the types of connections studied. 

(2) Electrically stable permanent connections can be characterized by 
their fatigue life. 


Il. TEST PROCEDURES AND RESULTS 


2.1 General 


The quality and uniformity of each group of connections was deter- 
mined by destructively testing approximately half the group and thus 
establishing the strength distribution of the remaining half, or test con- 
nections. The destructively tested or control connections were generally 
selected alternately by order of manufacture, from the whole group of 
connections. The destructive strength control data are presented in the 
Appendix along with a description of the destructive strength method 
used for each type of connection. 

Because of the statistical nature of the data collected in a program of 
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this type it is necessary to present the data in the form of probability 
distributions, as shown in Fig. 2. The ends of the bar represent the 10 
per cent and 90 per cent points on the distribution and the projecting 
line between represents the 50 per cent point. 


2.2 Linvironmental Tests 


Typical examples of the four types of connections compared in this 
study are shown in Fig. 3. Since this study is basically a comparison of 
the four types of connections, every effort was made to hold all of the 
unknown parameters to an absolute minimum, and this was accomplished 
to a great extent by subjecting all of the connections to the same en- 
vironment at the same time. The connections were mounted on fixtures 
to facilitate the handling and the mounting during exposure to the vari- 
ous environments. These mountings were of two types, as shown in Fig. 
4: (a) connections with insulation, consisting of 3-inch loops of wire con- 
necting two terminals, the group thus containing forty connections, i.e., 
20 loops of wire; (b) connections without insulation, the wire having a 
90° bend near the terminal and being fastened directly to a standoff 
insulator. This last type was designed for the resistance change measure- 
ments (Al) which were required for the temperature, corrosion and 
humidity test environments. 

This study consisted of the sixteen general wiring tests listed in Table 
I. Most of these were of the comparison type, but a few were studies con- 
cerned with only one type of connection. All of the vibration fatigue 
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Fig. 2 — Typical test data. 


PERMANENT ELECTRICAL CONNECTIONS 1071 





SOLDERED SOLDERLESS WRAP 





PERCUSSIVE WELD RESISTANCE WELD 


Fig. 3 — Typical examples of general wiring connections. 


tests, with the exception of test 14, used the vibration configuration 
shown in Fig. 5. 

The first footnote of Table I requires some explanation. The vibration 
fatigue life determinations in this study were spread over a considerable 
number of months because of the long exposure times for some of the 






3-INCH LOOP 
- OF INSULATED 
WIRE 





(a) 


Fig. 4 — Typical general wiring connections: (a) with insulation, (b) without 
insulation. 
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TasLe I — List or GENERAL WIRING TESTS 


























Insulation Fatigue Method & re 
a ° 
sez | 2 
Test Description eae = oP oh gus 5 i 
3 & |oselessl gae | Ss 
a s a |SS5/585! 265 50 
3 = 2 |e) Sama! sok aSy-4 
= = > 14 x Z a 
Vibration test without insu- 1 x x 80 6 
lation 
Vibration test with insula- 2 X xX 40 6 
tion 
Shock test, laboratory (fa- 3 xX | xX 40 7 
tigue life) 
Shock test, laboratory (de- 4 x 40 8 
structive test) 
Shock test, railroad non- 5* x xX 20 9 
cushioned 
Shock test, railroad cush- 6* xX x 20 9 
ioned 
Temperature test, central i xX x 40 10 
office conditions 
Temperature test, outside 8 x x 40 10 
plant conditions 
Three months corrosion test 9 xX X 20 11 
Six months corrosion test 10* x xX 20 11 
Humidity test ll xX x 40 12 
Lightly loaded bending test | 12 xX x 40 14 
(80° angular  displace- 
ment) 
Heavily loaded bending test | 13 xX xX 40 16 
(45° angular  displace- 
ment) 
Configuration test 14 xX | X 40 17 
Ultimate strength as afune- | 15 xX x 120+ 18 
tion of fatigue life 
Inferior weld test 16 xX xX 20t 19 











* Test conducted after recalibration of the vibration machine. 
J Only percussive welded connections tested. 


tests. Toward the end of the testing, the mounting springs on the table 
of the vibration machine had to be replaced and the machine recali- 
brated. This recalibration seems to have affected the fatigue life deter- 
mination of some or all of the subsequent tests; however, it has not been 
possible to determine the extent of this difference or even to prove that 
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Fig. 5 — Standard vibration configuration. 


RESISTANCE 
WELD (R) 





PERMANENT ELECTRICAL CONNECTIONS 1073 


it does or does not exist. If a difference does exist, it would introduce an 
error in the comparisons between the vibration fatigue life data of some 
of the later tests and the control fatigue life data (test 1 or 2) but would 
not affect the comparison among the four types of connections on a 
particular test, since they all experienced the same vibration environ- 
ment. 


2.2.1 Vubration Tests 


The vibration tests have been divided into two general classifications: 
(a) those for wires without insulation and (b) those for wires with insula- 
tion. 

The results of these two tests are shown in Fig. 6, and they cover the 
fatigue life of the four types of connections, with and without insulation. 
The connections which were connected with a loop were grouped in 
pairs. They were vibrated according to the schedule listed in Table II 
and used the vibration configuration illustrated in Fig. 5. The actual 
motion of the wires was a function of the mass of the loop as well as the 
acceleration of the connection; therefore all loops were made as close to 
the same size as possible. 

In test 1 (without insulation) it was necessary to solder loops on the 
connections, since these were set up for resistance change (AR) measure- 
ments as shown in Fig. 4(b) and therefore had no loops. Care was taken 
to control the size and weight of these loops so that they would closely 
approximate the loops with insulation in test 2, shown in Fig. 4(a). 


1. SOLDERED (S) 
2.SOLDERLESS WRAP (SW) 
3. PERCUSSIVE WELD (P) 


4.RESISTANCE WELD (R) 






40 


1, PERCUSSIVE WELD (P) 
2. SOLDERED (S) 

35 3. SOLDERLESS WRAP (SW) 
4. RESISTANCE WELD (R) 





nN 
M 


HOURS TO FAILURE 
= nN 
On {o) 





Fig. 6 — Vibration fatigue life results. (a) Test number.1 (without insulation) ; 
order of merit, based on vibration fatigue life. (b) Test number 2 (with insula- 
tion); order of merit, based on vibration fatigue life. 
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TaBLE IJ — Vispration ScHEDULE 
The vibration was sinusoidal in wave shape and its frequency varied from 5 
cps to 500 cps and back to 5 cps in 2 minutes. The displacement, acceleration, 
and running time schedules were as follows: 








Displacement (Inches) Acceleration (G’s) Running Time (Hours) 
0.1 5 2 
0.2 10 2 
0.3 15 2 
0.4 20 2 
0.5 25 to destruction 








Note: The cross-over frequency was approximately 30 cps: that is, 5 to 30 
cps controlled displacement and 30 to 500 cps controlled acceleration. 


Since the connections were grouped in pairs and the weakest connection 
of the pair failed first, no usable fatigue data could be obtained from the 
remaining connection and it was clipped off, leaving a stub which could 
be destructively tested. 


2.2.2 Shock Tests 


The shock tests can be divided into two general classifications: (a) 
laboratory shock and (b) railroad shock. Two laboratory shock tests 
were conducted, one using the vibration fatigue life after the shock test 
as a measure of the connection degradation, and the other using the 
destructive strength as an indication of any degradation suffered by the 
connections. These tests consisted of subjecting all four types of con- 
nections to 90 high-level shocks. All of the shock test connections were 
mounted on a fixture and experienced the same shocks at the same time. 
Half sine wave shocks were used with a peak amplitude of 500 to 600 
G’s and a duration of 2 to 3 milliseconds. There were no connection 
failures due to this shock environment. In the case of the fatigue life 
part, test 3, the connections were then subjected to the vibration sched- 
ule (Table II) and their fatigue life determined. The results of these 
tests are presented in Fig. 7. 

In the case of the destructive strength part, test 4, the connections 
were destructively tested after the 90 shocks, each in accordance with 
the method prescribed for it, as described in the Appendix. The results 
of these tests are compared to a destructive strength control and are 
shown in Fig. 8. 

It should be noted that the order of merit for test 4 could not be based 
on any absolute strength scale because of the differences in the destruc- 
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ARROWS INDICATE THREE 
MUTUALLY PERPENDICULAR 
DIRECTIONS OF ACCELERATION 
FROM LABORATORY SHOCK 
TEST. 30 SHOCKS IN EACH 
DIRECTION, 90 TOTAL 


WEIGHT OF 

WIRE LOOP 

0.18 TO 0.20 
GRAMS 


35 


1. SOLDERED (S) 

2.SOLDERLESS WRAP (Sw) 
30 3.PERCUSSIVE WELD (P) 

4.RESISTANCE WELD (R) 


25 
2-P 
20 


HOURS TO FAILURE 





Fig. 7 — Vibration fatigue life after laboratory shock tests. (a) Typical con- 
nection orientation during shock tests. (b) Test number 3 (with insulation); 
order of merit, based on vibration fatigue life. 
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Fig. 8 — Destructive strength after laboratory shock tests (test number 4); 
order of merit, based on percentage of destructive test degradation. 
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tive strength tests. It was therefore based on the percentage degradation 
indicated by the difference between the mean of the control sample and 
the mean of the sample which had been exposed to the shock. 

The railroad shock tests were of two types: (a) noncushioned, test 5, and 
(b) cushioned, test 6. These tests consisted of sending the four types of 
connections in a rigid plywood box from Columbus, Ohio, to New York 
City by railway express for a total of ten round trips. The connections 
from both tests, numbers 5 and 6, were shipped in the same container; 
however, the noncushioned connections were fastened directly to the 
plywood box, whereas the cushioned connections were supported by 
rubberized hair in the center of the container. The fatigue life of the con- 
nections was determined after the ten round trips by vibrating them ac- 
cording to the vibration schedule (Table II). The results of both tests 
are presented in Fig. 9. There was very little difference in degradation 
observed for the two tests, and they resulted in identical orders of merit 
based on the vibration fatigue life. 

The vibration fatigue life determination for these two tests was con- 
ducted after recalibration of the vibration machine as discussed in Sec- 
tion 2.2, and this could affect the comparison between these tests and the 
control; however, it will not affect the comparison of each type of con- 
nection within tests 5 and 6. 


1. SOLDERED (S) 
2. SOLDERLESS WRAP (SW) 


3. PERCUSSIVE WELD (P) 
4. RESISTANCE WELD (R) 


HOURS TO FAILURE 





Fig. 9 — Vibration fatigue life after railroad shock and vibration. (a) Test 
number 5 (noncushioned); (b) test number 6 (cushioned). The order of merit, 
based on vibration fatigue life, was the same for both tests. 
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2.2.3 Temperature Tests 


The connections were subjected to two types of temperature tests: 
(a) central office conditions, test 7, and (b) outside plant conditions, 
test 8. 

The central office temperature test consisted of subjecting forty con- 
nections of each type to a temperature of 105°C for a total of 154 days. 
Once a week the connections were removed from the oven and allowed 
to come to room temperature (20°C). Every two weeks the connections 
were mechanically disturbed (plucked) and the change in resistance, 
AR, was measured. After the 154 days, loops were carefully soldered to 
the connections and they were subjected to vibration according to the 
vibration schedule (Table IT) and their fatigue life determined. The re- 
sults are presented in lig. 10. All forty connections of each type sur- 
vived the 154-day test without developing a change in resistance, AR, 
of 0.001 ohm, which would have constituted an electrical failure. The 
solderless wrap connections, however, did develop a considerably higher 
number of AR’s, as shown in Table III. 

The soldered, solderless wrap, and resistance welded connections 
showed a small but significant loss in fatigue life due to test 7. The 
percussive welded connections, however, showed a larger, more signifi- 
cant loss in fatigue life due to the temperature test. 

The outside plant temperature test, test 8, was very similar to the 
central office condition temperature test. The differences were that test 
8 ran for 168 days and that when the connections were removed from the 


1. SOLDERED (S) 

2. PERCUSSIVE WELD (P) 
3. SOLDERLESS WRAP (SW) 
4. RESISTANCE WELD (S) 


1-S 
- # \ 


HOURS TO FAILURE 





Fig. 10 — Vibration fatigue life after temperature tests. (a) Test number 7 
(central office conditions); (b) test number 8 (outside plant conditions). The order 
of merit, based on vibration fatigue life, was the same for both tests. 
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Tas_LE II] — Numper or AR VALUES OBSERVED FOR 
Eacu Typr oF CONNECTION 


























Soldered Solderless Wrap Percussive Weld Resistance Weld 
AR Values in 
Milliohms 
co OP co OP CO OP co OP 
0.2 6 none 42 44 1 1 3 1 
0.3 none none 8 9 none none none none 
0.4 none none 1 2 none none none none 
0.5 none none 4 1 none none none none 
0.6 none none none 1 none none none none 





Note: CO = central office conditions, OP = outside plant conditions. 


oven at 105°C they were immediately placed in a cold box at —40°C 
and allowed to stabilize at this temperature. After approximately an 
hour at —40°C, they were removed and allowed to come to room tem- 
perature (20°C). Resistance change data and vibration fatigue life in- 
formation were obtained in the same manner as the central office tem- 
perature test; the results are presented in Table III and Fig. 10. The 
changes in resistance and fatigue life data for the outside plant condi- 
tions were similar to the results obtained from the central office condi- 
tions, except that the soldered and solderless wrapped connections sus- 
tained less degradation. Both temperature tests yielded the same order 
of merit. 


2.2.4 Corrosion Tests 


The corrosion tests were of two types: (a) three months, test 9, and 
(b) six months, test 10. These tests were identical in all respects with the 
exception of the exposure time, as indicated in Fig. 11. They consisted of 
exposing all four types of connections to the corrosive atmosphere of 
New York City on the roof of the Bell Laboratories building at West 
Street. Resistance change measurements were made on all of the con- 
nections before and after exposure. All corrosion test connections sur- 
vived the environment without developing the 0.001l-ohm resistance 
change which would have constituted an electrical failure. 

The apparent increase in fatigue life of all the connections, except 
for percussive welded connections, in the six months corrosion test is 
unexplainable except for the recalibration of the vibration machine dis- 
cussed in Section 2.2. This is the most probable cause of this incon- 
sistency, since there is no reason to expect that exposure to a corrosive 
atmosphere can improve the fatigue life of any type of connection. 


PERMANENT ELECTRICAL CONNECTIONS 1079 


1. PERCUSSIVE WELD (P) . SOLDERED (S) 

2. SOLDERED (S) . PERCUSSIVE WELD (P) 
3. SOLDERLESS WRAP (Sw) . SOLDERLESS WRAP (SW) 
4. RESISTANCE WELD (R) . RESISTANCE WELD (R) 
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10-SW 
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(a) 





Fig. 11 — Vibration fatigue life and orders of merit after corrosion tests. (a) 
Test number 9 (three months); (b) test number 10 (six months). 


2.2.5 Humidity Test 


The humidity test, test 11, consisted of subjecting all four types of 
connections to controlled temperature and humidity conditions accord- 
ing to the following schedule: they were exposed to 90 per cent relative 
humidity and 85°F dry bulb temperature for six consecutive days, then 
dried at 140°F for two days. This cycle was repeated eight times for a 
total test time of 64 days. Resistance change measurements were made 
before, during, and after the test with no electrical failures being ob- 
served. The fatigue life of the connections was determined by carefully 
soldering loops of wire to the connections and vibrating them according 
to the vibration schedule (Table I1). The results are presented in Fig. 12. 


1. SOLDERED (S) 

2. PERCUSSIVE WELD (P) 
3. SOLDERLESS WRAP (SW) 
4. RESISTANCE WELD (R) 


HOURS TO FAILURE 





Fig. 12 — Vibration fatigue life and order of merit after humidity test, test 
number 11. 
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2.2.6 Bending Tests 


All four types of connections were subjected to two bending tests: 
(a) the lightly loaded bending test, test 12, and (b) the heavily loaded 
bending test, test 13. 

Bending, in the lightly loaded bending test, took place in a horizontal 
plane as shown in Fig. 13. The load on the connections came from the 
tension in the wire and varied from approximately 0 to 4 grams. The wire 
was moved 30° in one direction from its equilibrium position and then 
returned, and then moved 30° in the other direction and returned to its 
equilibrium position. This constituted one cycle. The number of such 
cycles to failure for each connection was the fatigue data. Forty con- 
nections of each type were tested and the results are presented in Tig. 14. 

The heavily loaded bending test consisted of determining the bending 
fatigue life of all four types of connections by bending in a vertical plane 
with 300 grams hanging on the connection, as shown in Fig. 15. At a 
relatively slow rate the terminal was rotated 45° from its originally hori- 
zontal position and then returned. This constituted one cycle. The 
number of such cycles required to cause failure of each connection was the 
fatigue data. Forty connections of each type were tested for all connec- 
tions except solderless wrap, of which 32 were tested; results are presented 
in Fig. 16. 


2.2.7 Additional Tests 


There were three additional tests conducted in this study; (a) con- 
figuration test, test 14, (b) ultimate strength as a function of bending 
fatigue life for percussive welded connections, test 15, and (c) inferior 
weld test, test 16. 

The configuration test consisted of determining the vibration fatigue 
life of the four types of connections, using a different configuration. The 
90° bend used in the configuration shown in Fig. 5 was eliminated and the 
wire was brought straight from the terminal as shown in Fig. 17(b). Vi- 


th. 


SOLDERED (S) SOLDERLESS WRAP PERCUSSIVE WELD RESISTANCE WELD 
( (P) (R) 





Fig. 13 — Lightly loaded bending configurations. 
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Fig. 14 — Lightly loaded bending fatigue life, test number 12 (with insulation); 
order of merit, based on bending fatigue life. 
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Fig. 15 — Heavily loaded bending configurations. 
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Fig. 16 — Heavily loaded bending fatigue life, test number 13 (with insula- 
tion); order of merit, based on bending fatigue life. 


1082 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


1. SOLDERLESS WRAP (SW) 
2.PERCUSSIVE WELD (P) 
3. SOLDERED (S) 
4.RESISTANCE WELD (R) 


HOURS TO FAILURE 





GP MiP GP 


SOLDERED (Ss) SOLODERLESS PERCUSSIVE RESISTANCE 
WRAP (sw) WELD (P) WELD (R) 


(b) 


Fig. 17 — (a) Vibration fatigue life with configuration omitting 90° bend in the 
lead-off wire, test number 14 (with insulation); order of merit, based on vibra- 
tion fatigue life. (6) Vibration configuration omitting 90° bend. 


bration still took place in a vertical plane and the vibration schedule 
(Table IT) was followed. The results of this test are also shown in Fig. 
17(a). All four types of connections showed a deterioration in their fa- 
tigue life when the 90° bend was omitted. This bend apparently partially 
isolates the connections from external wire movements and therefore 
improves their fatigue life. 

The objective of test 15 was to determine the ultimate strength of 
percussive welded connections as a function of the bending fatigue life. 
The fatigue method selected was the lightly loaded bending type. A 
total of 240 percussive welded connections were used in this test; they 
were divided into two groups of 120 each. The first group was destruc- 
tively tested immediately after manufacture to determine their strength, 
and these data are included in the destructive strength control presented 
in the Appendix. The remaining 120 connections were divided into eleven 
groups: one group of 20 connections and 10 groups of 10 connections 


PERMANENT ELECTRICAL CONNECTIONS 1083 


TaBLE [V — ScHEDULE FoR Test 15 





Number of lightly | 500} 600} 700) 800} 900; 1000; 1100; 1200) 1300) 1400 
loaded bending cy- 
cles 

Surviving connections 10; +10 9} 10 9 9 7 3 6 2 

Average strength of | 11.1) 11.1) 11.1) 11.2/ 10.8) 10.9} 11.1} 10.7) 10.5] 10.7 
surviving connec- 
tions in pounds 














each. The 20-connection group was fatigued to failure by the lightly 
loaded bending method to establish the fatigue life of the connections. 
The other ten groups were each fatigued to a predetermined number of 
bending cycles according to the schedule shown in Table IV. After the 
connections in each group had been fatigued for the number of cycles 
assigned to that group, the connections remaining were destructively 
tested using the combined test. 

The results of this test, presented in Fig. 18, indicate that at 100 per 
cent of the average fatigue life the surviving connections show only an 
approximate 5 per cent reduction in their ultimate strength. This indi- 
cates that the mechanical quality of a percussive welded connection is 
characterized better by its fatigue life than by its ultimate strength. 

The object of the inferior weld test, test 16, was to determine the ulti- 
mate strength and fatigue life of inferior percussive welds and to com- 
pare these characteristics with those of good welds. The welds were made 
inferior by using insufficient capacitance in the welding power supply 
during their manufacture. The results are presented in Fig. 19. The 


— 





PER CENT OF AVERAGE WIRE STRENGTH 





PER CENT OF AVERAGE FATIGUE LIFE 


Fig. 18 — Ultimate strength as a function of bending fatigue life for percussive 
welded connections, test number 15 (without insulation). 
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Fig. 19 — Inferior weld test, test number 16: (a) destructive strength — in- 
ferior welds show 55 per cent of the strength of good welds, based on the means 
of the distributions; (b) bending fatigue life (without insulation, lightly loaded 
bending) — inferior welds show 17 per cent of the bending fatigue life of good 
connections, based on the means of the distributions. 


combined destructive test was used to determine the ultimate strength of 
the connections, and the fatigue method selected was of the lightly loaded 
bending type. 

The welds used in this test showed only 55 per cent of the strength of 
good connections, and the fatigue life of the inferior type welds showed 
17 per cent of the fatigue life of good connections. These results indicate 
that, for percussive welding, bending fatigue life is a much more sensitive 
indicator of weld quality than the strength of the connection as measured 
by the destructive combined test. 


III. DISCUSSION 


3.1 Importance of Fatigue 


The measure of life expectancy chosen for this study was the fatigue 
life of the connections. The effect of the various nondestructive environ- 
ments was measured by determining the loss of fatigue life caused by 
them. During testing, no significant electrical degradation was observed 
on any of the test connections as long as they remained mechanically 
secure. Comparison of the four types of connections was therefore made 
on the basis of their mechanical characteristics. Fatigue life was found 
to be the most important mechanical characteristic of permanent elec- 
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trical connections, as well as being one of the most sensitive indicators 
of connection degradation. 

Fatigue life was measured in both vibration and bending. The vibration 
fatigue life was measured by the number of hours the connections sur- 
vived a specified vibration schedule, and the bending fatigue life was 
measured by the number of cycles to failure of the connections in the 
lightly loaded and heavily loaded bending configurations. A comparison 
based on fatigue life offers an absolute comparison scale for all types of 
connections and facilitates the determination of an order of merit. 

A given connection is better than another if, under identical environ- 
mental conditions, its life is longer. The life of a connection has ended 
when for any reason it fails to provide an adequate path for electrical 
current. In general, a connection may fail in two ways, electrically or 
mechanically. An electrical failure occurs when the connection develops 
an electrical characteristic such as a large constant or variable resistance 
which is incompatible with the equipment in which it is used. This type 
of failure is generally important only for pressure type connections; 
with properly designed and applied connections of this type it poses no 
serious threat to long life. 

A mechanical connection failure occurs when the conducting path is 
physically broken. Most of the failures that are found in normal use fall 
into this group, and these are very small in number. Mechanical failures 
can be further divided into two groups, excessive force failures and fatigue 
failures. An excessive force failure is defined as a failure due to the appli- 
cation of a destructive force greater than that which a new, average, good 
connection can withstand. The probability of such a failure for good 
connections is directly related to the care with which the connections 
are handled and is generally very low. 

A fatigue failure is defined as any mechanical failure in which the re- 
duction in fatigue life was the primary cause of failure. If a connection 
is repeatedly stressed to a value below its breaking point, by definition 
it is being fatigued, and its time of ultimate failure will have been signifi- 
cantly influenced by its fatigue history. Thus most mechanical connec- 
tion failures are due to fatigue, and since most connection failures are 
mechanical, it is apparent that most of the connections which fail in 
service do so because of fatigue. 

The basic measure of quality of a permanent connection is its life. 
Any adverse environment to which the connection is subjected usually 
results in a reduction in either its electrical or mechanical life. There is 
no environment present in a typical central office which will cause a 
significant reduction in the electrical life of connections of the types and 
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qualities considered in this study so long as the connection remains 
mechanically secure. Two of the most severe environments in a central 
office are the small-amplitude vibrations due to equipment operation and 
the occasional bending of the connection during testing and wiring 
changes. These are both of a fatigue nature, and if the connection pre- 
maturely fails, it will probably be due to fatigue. Other environments 
such as temperature, corrosion, and humidity will in general not cause 
connection failure but will reduce the fatigue life, as shown in this study, 
and thus hasten the failure of connections. It follows, therefore, that 
most connection failures in service will be fatigue failures and that the 
basic measure of mechanical quality of a permanent connection is its 
fatigue life. 

The results of this test program offer a number of arguments support- 
ing the hypothesis that fatigue life is the most important mechanical 
characteristic of a good connection. These will be presented in the form 
of statements and then the supporting data and reasoning will be dis- 
cussed. 

(1) The ultimate mechanical strength of a connection is a poor meas- 
ure of the fatigue life remaining in the connection. 

Test 15 (Fig. 18) illustrates the above statement for percussive welds. 
In this test, connections were fatigued a predetermined number of cycles 
by the lightly loaded bending method and then the surviving connections 
were destructively tested. The results show that, at 100 per cent of the 
average fatigue life, the average strength of the surviving connections 
was approximately 95 per cent. In other words, when almost all of the 
fatigue life of the connection was expended, it showed only a 5 per cent 
loss in strength. The surviving connections were actually in bad shape, 
but a destructive strength test would have indicated hardly any degra- 
dation. 

A second statement, closely associated with the first, is as follows: 

(2) Loss in the ultimate mechanical strength of a connection is gener- 
ally coincident with an even greater loss in its fatigue life. 

Test 15 supports this statement. Further support is provided by the 
inferior weld test, Fig. 19. Inferior percussive welds were manufactured 
on purpose by using insufficient capacitance in the welding power sup- 
ply, and the destructive strength and bending fatigue life distributions 
determined for these inferior welds. The results show that the destruc- 
tive strength average dropped to 55 per cent of the good weld value and 
the bending fatigue average dropped to 17 per cent of the good weld 
value. Thus the fatigue life is approximately three times as sensitive an 
indication of inferior welds as destructive strength tests. 
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3.2 General 


The most important factor affecting the fatigue life of a permanent 
connection is the manner in which the wire is brought from the terminal 
or its configuration. The configuration test, test 14, showed that when the 
90° bend in the standard configuration was omitted, the fatigue life of all 
four types of connections was drastically reduced. The configuration 
used in test 14 may or may not have been the worst possible fatigue life 
configuration for the connections, but there can be no doubt that it is 
possible for the configuration to alter the fatigue life by approximately 
a factor of two. The results of the configuration test lead to the conclu- 
sion that the 90° bend in the standard configuration apparently partially 
isolates the connection from external wire movements and therefore 
improves the fatigue life of connections. 

In general, the results of the shock tests indicate that all of the con- 
nections suffered some mechanical degradation from this environment. 

The laboratory shock test (fatigue life) and both railroad shock tests 
show that in general it is the high fatigue life connections, i.e., those on 
the high end of the distribution, which are most affected by the shock 
environment. 

Laboratory shock tests 3 and 4 afford one of the few opportunities for 
a direct comparison of the fatigue life and destructive strength test 
methods of measuring the degradation of the connections. The connec- 

tions of both tests were exposed to the same shock environment and the 

degradation due to this environment was measured in two ways, (1) 
vibration fatigue life (Fig. 7) and (2) a destructive test (Fig. 8). Table 
V summarizes the results of these two tests and presents the degradation 
as a percentage of the control value. 

Inspection of Table V indicates that for all four types of connections 
fatigue life is the more sensitive indicator for the measurement of shock 
degradation. This table thus further substantiates the statement that the 


TABLE V—Suock DEGRADATION AS MEASURED BY FATIGUE LIFE 
AND DeEstTRUCTIVE TESTS (PERCENTAGE DEGRADATION DUE 





TO SHOCK) 

Connection Type Fatigue Life Destructive Test 
Soldered 23% 15% 
Solderless wrap 24% 10% 
Percussive weld 23% 1% 


Resistance weld 22% 8%, 
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mechanical strength of a connection is a poor measure of the fatigue life 
remaining in the connection. 

The results of the central office and outside plant temperature tests, 
7 and 8 respectively, showed, in general, a small but significant loss in 
fatigue life for all types of connections except for percussive welded 
connections. The degradation experienced by percussive welded connec- 
tions from both temperature tests was greater than that of the other 
three types and amounted to approximately 30 per cent of the control 
fatigue life. Even this significant loss, however, did not prevent percus- 
sive welded connections from placing second in the order of merit. 


3.2.1 Composite Vibration Fatigue Life 


The vibration fatigue life data from all of the tests without insulation 
have been added together to form a composite or summary for each type 
connection. This summary is presented in Fig. 20 and Table VI; the 
table contains a list of the tests and the number of connections of each 
type which were included. The order of merit based on the vibration 
fatigue life for the connections without insulation is as follows: 

1. percussive weld 
2. soldered 

3. solderless wrap 
4, resistance weld. 

A similar summary for the connections with insulation is presented 
in Fig. 21 and Table VII. It should be noted that the data scatter, as 
indicated by the length of the bar in the soldered and solderless wrap 
distributions, has increased, compared to the distribution without in- 
sulation, whereas the scatter of percussive and resistance welded data 


25 


1, PERCUSSIVE WELD (P) 
2. SOLDERED (S) 

20 3. SOLDERLESS WRAP (SW) 
4.RESISTANCE WELD (R) 


HOURS TO FAILURE 





Fig. 20 — General wiring vibration fatigue life summary (without insulation); 
order of merit, based on vibration fatigue life. 
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Taste VI — Visration Test Data SumMARY (wiTHOUT INSULATION) 








Number of Data Points From Each Test 




















Test Description Test Number 
Salacced ae ii ae Resistance 

Vibration J 40 40 37 40 
Temperature 

central office 7 20 20 20 20 

outside plant 8 20 20 20 20 
Corrosion 

3 months 9 10 10 10 10 

6 months 10 10 10 10 10 
Humidity 11 20 20 20 20 
Totals 120 120 117 120 











1. SOLDERLESS WRAP (SW) 
2. SOLDERED (S) 
3.PERCUSSIVE WELD (P) 
4.RESISTANCE WELD (R) 


HOURS TO FAILURE 





Fig. 21 — General wiring fatigue life summary (with insulation); order of 
merit, based on vibration fatigue life. 


TaBLe VII — Visration Test Data SuMMARY (WITH INSULATION) 











Number of Data Points From Each Test 
Test Description Test Number 
Soldared —— eve Bean 
Vibration 2 20 20 20 20 
Configuration 14 20 20 20 20 
Shock 
Laboratory 3 14 20 18 20 
Railroad 
Noncushioned 5 10 10 10 10 
Cushioned 6 10 10 10 10 


Totals | 74 80 78 80 
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has remained about the same. This is probably due to the insulation 
effect on the soldered and solderless wrapped connections. These con- 
nections were manufactured with the insulation in close proximity to 
the terminal, which is the normal wiring procedure; however, the exact 
location of the insulation was not accurately controlled and this resulted 
in a somewhat random insulation effect. 

The order of merit based on the vibration fatigue life of the connections 
with the insulation is as follows: 

1. solderless wrap 
2. soldered 

3. percussive weld 
4. resistance weld. 

It is the opinion of the author that the “without insulation” data is 
more important than the ‘with insulation” data, because the presence 
of insulation close to the connection cannot be depended upon in the case 
of soldered and solderless wrap connections. The worst case, without in- 
sulation, is therefore the most important. Inspection of the bar graphs in 
Fig. 20 and 21 shows that there is very little difference in the low end of 
the soldered and solderless wrap distributions, which indicates that the 
low fatigue life connections in the “with insulation” distribution were 
essentially without insulation. Since it is the low fatigue life connections 
which are most vulnerable to early failure, it follows that the most mean- 
ingful comparison should be based on the “without insulation” data. 

If the presence of insulation could be assured, as in the case of the 
modified solderless wrap which has one turn of insulated wire as part of 
the connection, the fatigue life would undoubtedly be consistently higher 
and the ‘‘with insulation” data would be more meaningful. 

The above two summaries were obtained by adding the fatigue life 
data of the tests involved to form a composite distribution. This pro- 
cedure weights the final distribution according to the number of connec- 
tions in each type of test, and this is certainly not the only way to treat 
the data. However, all four types of connections were treated equally, 
and so a comparison between them is meaningful. 


3.2.2 Orders of Merit 


An order of merit based on vibrational fatigue life statistical distribu- 
tions was obtained from all environmental tests which compared the 
four types of connections and which used vibrational fatigue as a meas- 
ure of degradation. It is also possible to obtain an order of merit for these 
tests from the number of stub connections of each type which survived 
the complete vibrational fatigue life determination. When one connec- 
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tion of a pair failed during the vibration test, the loop was clipped off, 
leaving the remaining good connection with a wire stub long enough to be 
destructively tested. Then the vibration test was continued until all 
pairs of connections had at least one failure. This additional vibration 
caused some of the stubs from the surviving connections to fail. Since all 
four types of connections initially had the same number of test connec- 
tions, an order of merit can be obtained from the number of surviving 
stub connections by assigning the first position to the connection type 
which had the highest number of surviving stubs, second place to the 
type having the next highest number, and so forth. This procedure has 
meaning because all stub connections remained on the vibration machine 
until the end of the vibration period. Table VIII compares the orders of 
merit obtained from the number of stubs remaining with those obtained 
by consideration of the fatigue life distribution of the four types of con- 
nections. 

In combining the orders of merit in Table VIII, it seems reasonable to 
assume that the soldered and percussive welded connections are of ap- 
proximately equal merit and should share the number 1 position in an 
over-all vibrational order of merit as follows: 

{soldered 

percussive weld 
3. solderless wrap 
4. resistance weld. 

It seems desirable to establish an over-all order of merit, shown in 
Table IX, for the four types of connection considered in the general 
Wiring portion of this study. It is possible to do this because approxi- 
mately equal numbers of all four types of connections were subjected to 
the various environments at the same time and under identical condi- 
tions. It should be remembered, however, that the best connection in the 


Taste VIII — Orver or Merit Comparison 








Order of Merit 





Based on Number of Surviving 


Stubs Based on Fatigue Life 


First | Second| Third | Fourth] First | Second | Third | Fourth 




















Percussive weld (P) 6 2 3 0 2 6 3 0 
Soldered (8) 4 6 J 0 8 2 1 0 
Solderless wrap (SW) 1 3 7 0 1 3 7 0 
Resistance weld (R) 0 0 0 11 0 0 0 11 








Numbers represent the number of firsts, seconds, thirds, and fourths for 
each rating method. 
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Tasue IX — Over-ALL ORDER oF MEeEnRritT 





Order of Merit 








Description Pena 
First Second Third Fourth 
Vibrational fatigue 1600 S&P — SW R 
Lightly loaded bending fatigue 160 P SW Ss R 
Heavily loaded bending fatigue 152 SW P Ss R 
Over-all order of merit based on 1912* P S) SW R 


fatigue 


Legend: 8, soldered; SW, solderless wrap; P, percussive welded; R, resistance 
welded. 
* Sum of above three groups of connections. 


over-all order of merit was not necessarily the best under every environ- 
mental test, but was best only in an average sense. 

Percussive welded connections were assigned first place because they 
shared first place with the soldered in the most important group of 
tests, vibrational fatigue (1600 test connections), and were also first 
in the lightly loaded bending tests. Soldered connections were assigned 
second place on the basis of their sharing first place in the vibrational 
fatigue tests. Resistance welded connections were assigned last place 
for obvious reasons, leaving third place for solderless wrapped connec- 
tions. 

It should be remembered that the soldered and solderless wrapped con- 
nections used in this study were of the same quality as those in wide- 
scale use in the telephone plant today. These connections have given 
and continue to give satisfactory service in the central office environ- 
ment. 

On the other hand, the percussive welded connections used in this 
study were of higher quality than those which have been used in special 
applications. This higher quality results from the use of the monitoring 
technique described in Ref. 1. These monitoring criteria are strongly 
recommended for all applications of percussive welding to assure the 
high quality of which the process is capable. 


3.2.3 Summary 


Table X, connection suitability as a function of environment (based 
on fatigue life), attempts to present the results of the general wiring 
portion of this study in a manner which would aid in the selection of a 
connection for a given environment. A rating number, based on the 
fatigue life of the connections, is assigned to each type of connection for 
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‘TABLE X — CONNECTION SUITABILITY AS A FUNCTION OF ENVIRONMENT 
(BASED ON FatiqguE Lirs) 
























































Without Insulation With Insulation 
Test Description Test Fatigue | Sol- is . so]- ” . 
° No. | Methed*| 1 | Ser | Cas Resis-| sol. fee ane: Reis 
dered | Jess | give | Weld | dered | less | ive | Weta 
Vibration test 1&2 | VIB 7 5 8 4 10 8 8 4 
Shock test labora- 3 VIB 8 7 6 3 
tory 
Shock test railroad 5 VIB 9 7 6 3 
noncushioned 
Shock test railroad 6 VIB 9 7 7 3 
cushioned 
Temperature _ test 7 | VIB 6 4 6 3 
central office con- 
ditions 
Temperature — test 8 | VIB 7 5 6 3 
outside plant con- 
ditions 
Three months cor- 11 VIB 7 5 8 4 
rosion 
Six months corrosion] 12 VIB 7 5 8 4 
Humidity tests 13 | VIB 7 5 rs 3 
Configuration test 14 | VIB 4 6 4 2 
Composite — | VIB 7 5 8 4 7 7 6 | 3 
Lightly loaded | 12 | LLB 3 5 | 10 2 
bending test 
Heavily loaded | 13 | HLB 8 | 10 8 5 
bending test 

















* Fatigue method: VIB — vibration; LLB — lightly loaded bending; HLB — 
heavily loaded bending. 


each test conducted. The table is divided into three general areas defined 
by the double lines. The top area is concerned with the vibration fatigue 
life both with and without insulation, the middle area is concerned with 
the lightly loaded bending fatigue method, and the last area at the bot- 
tom is concerned with the heavily loaded bending fatigue method. The 
rating numbers within any of the three areas are consistent among 
themselves. However, comparison of rating numbers from different 
areas has no meaning. 

The rating system used is as follows: the number 10 was assigned to 
the highest value of fatigue life in a given area. The other rating numbers 
in the same area were generally assigned according to the percentage of 
fatigue life they had, compared to the highest value. Some adjustment of 
the rating numbers has been made to take into consideration the orders 
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of merit. In general, the rating numbers are closely associated with the 
fatigue life of the connections involved and offer a reasonably good index 
as to how well a connection will perform in a given environment. 

Table X should be particularly valuable in selecting a connection for 
a given environment. The value of Table X is derived from the fact 
that, within a given fatigue area, valid cross comparisons can be made. 
Thus it is possible to get an idea of the degradation caused by shock on 
a percussive welded connection as compared to that caused by tempera- 
ture or humidity or some other environment totally different from shock 
on, say, a soldered or solderless wrapped connection. These cross com- 
parisons are possible because the common parameter chosen to measure 
degradation was fatigue life. 

The “composite” listed under test description in the vibration fatigue 
area of Table X was obtained from the results of the vibration fatigue 
life summaries of Section 3.2.1. 

The effect of insulation on the wire in the vicinity of a connection is 
apparent from the vibration portion of Table X. The percussive and re- 
sistance welded connections are relatively unaffected by the presence or 
absence of insulation. This result was expected, since the manufacture 
of these connections requires the insulation to be removed from the 
area near the welds. The soldered and solderless wrapped connections 
show a significant increase in their fatigue life when insulation is close 
to the connection. 

Since improved vibration fatigue life is obtained when a soldered or 
solderless wrapped connection has insulation close to the terminal, it 
seems reasonable to inquire about the possibility of manufacturing 
these connections with the insulation always close to the terminal in 
order to take advantage of the increased fatigue life. It seems unlikely 
that this could be done for the soldered connections, because of the ad- 
verse effect of the heat on the wire insulation. In the case of the solderless 
wrapped connection, however, modified wrapping bits are available 
which place a turn of insulated wire around the terminal. The average 
vibration fatigue life of these modified wraps would undoubtedly be 
greater than either the “with” or “without” insulation connections 
tested in this program. It is the opinion of the author that the modified 
solderless wrapped connections could show enough improvement in 
their fatigue life to take over first place in the order of merit as opposed 
to third place without the modified wrap. In summary, it can be stated 
that a significant improvement in vibration fatigue life could be obtained 
on solderless wrapped connections by using a modified wrap. 

It should be pointed out, however, that the need for this increased 


PERMANENT ELECTRICAL CONNECTIONS 1095 


fatigue life in normal Bell System field applications appears unnccessary 
in view of the fact that billions of solderless wrapped connections are in 
use in the telephone plant today and have given excellent service since 
their introduction ten years ago. 


IV. CONCLUSIONS 


4.1 General 


The following three conclusions apply to all four types of electrical 
connections covered in this study. These connections were made with 
one size and type of terminal (0.025 by 0.062 inch nickel-silver) and 
with one type of wire (24-gauge solid copper); consequently, the con- 
clusions may not hold for all types and sizes of terminals and wire. 

(1) Assuming adequate electrical stability, a permanent electrical con- 
nection can be characterized by its fatigue life. Thus, if the fatigue life 
of a connection is well defined for various fatigue methods and the effect 
of adverse environments is determined on these fatigue lives, the me- 
chanical quality of the connection has been established. 

(2) A bend in the wire in the vicinity of the connection partially iso- 
lates it from external wire movements and significantly improves its 
fatigue life. 

(3) The ultimate mechanical strength of a connection is a poor meas- 
ure of the fatigue life of a connection. 


4.2 Specific 


(1) Using fatigue life as a basis of comparison and soldered connections 
as a reference standard, it 1s concluded that: 

(a) monitored percussive welded connections are, in general, superior 
to soldered connections for the conditions which existed during this 
study, 

(b) over-all, solderless wrapped connections are essentially equivalent 
to soldered connections, for the conditions which existed during this 
study, and 

(c) resistance welded connections are significantly inferior to soldered 
connections for the conditions which existed during this study. 

These conclusions represent over-all averages for all of the conditions 
tested. In some specific environments they may be interchanged or re- 
versed. See Table X for details. 

(2) Table X gives a good estimate of the comparative fatigue life which 
can be expected from the four types of connections under the various 
environmental test conditions. 
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(3) All four types of connections show loss in fatigue life and destruc- 
tive strength due to repeated high level shocks. 

(4) The shock and vibration experienced by all four types of connec- 
tions when shipped by railroad express can result in loss of fatigue life. 

(5) In general, all four types of connections show some loss in fatigue 
life due to the temperature tests environment. Percussive welded con- 
nections show the most significant loss. 

(6) A significant improvement in the fatigue life of solderless wrapped 
connections can be expected through use of a modified wrap which 
places one turn of insulation around the terminal. 
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APPENDIX 


Connection Manufacture and Quality Control 


This appendix describes the destructive strength testing methods used 
for each type of connection. It also presents the data resulting from these 
tests in the form of statistical distributions. 


A.1 General Wiring 


All of the connections were made using one type of terminal and wire 
from two spools. 
The terminal was of the solderless wrapped type shown in Fig. 22. It 





' 
“ie 
p 


0.0625” 


Fig. 22 — General wiring terminal; material is nickel-silver. 
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was made of nickel silver 0.025 inch thick and {4g inch wide in the area 
of the connection. The soldered and solderless wrapped connections were 
made to this terminal in the conventional manner and the percussive 
and resistance welded connections were made on the 4g inch wide side. 
The wire used in all general wiring connections was standard switch- 
board wire, 24-gauge, solid, tinned copper wire with polyvinyl] chloride 
insulation. Toward the end of the test sequence, the first spool of wire 
was exhausted and it was necessary to use another spool. The elongations 
of the two wires, measured over a 10-inch length, were as follows: 
1. first spool 16 to 19 per cent elongation 
2. second spool 15 to 18 per cent elongation. 


A.1.1 Soldered 


The soldered connections were manufactured in accordance with the 
present standards. They were actually of the wrapped and soldered type, 
since more uniformity could be obtained by wrapping three to three and 
one-half turns on the terminal with a wrapping tool and then soldering. 

In the case of the soldered connections, there was no generally accepted 
method for determining the strength of a connection, so a testing pro- 
cedure was devised. The testing procedure consisted of mounting the 
test terminal in a universal joint type of arrangement as shown in Fig. 
23 and measuring the peak force required to pull the wire completely 
off the terminal. 

This procedure for determining the strength of soldered connections 
was evaluated by using it to measure the strength of the extreme condi- 
tions of soldered connections: (1) a good connection and (2) an extremely 
poor soldered connection made by wrapping 3 to 34 turns around the 
terminal and applying no solder to the connection. Twenty-five connec- 
tions of each type were tested, and the results are shown in Table XI. 

A total of 1040 soldered connections was manufactured for this test 
program under conditions which assured high quality and uniformity. 
Half of these were used in determining their vulnerability to various 
environmental conditions, as described in the main body of the report, 


TABLE XI — STRENGTH OF SOLDERED CONNECTIONS 
Force Reading in Pounds 


Type Connection 
Maximum Minimum Mean 





Good connection 7.5 4.8 5.76 
Poor connection 3.5 2.6 3.04 
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PERCUSSIVE WELDED RESISTANCE WELDED 


Fig. 23 — Destructive tests. 


and the other half were destructively tested according to the procedure 
outlined above. The results of these destructive tests are presented in 
Fig. 24. 


A.1.2 Solderless Wrap 


The solderless wrapped connections were manufactured according to 
present standards, using qualified wrapping bits and wire of the proper 
elongation. The operator and wrapping bit effect on connection quality 
was minimized by having half the connections made by each of two oper- 
ators, who in turn made half of their connections with one qualified bit 
and half with another. A total of 1040 connections was manufactured, 
half of which were destructively tested immediately after manufacture 
in order to determine the quality of the remaining connections, which 
were used in the comparison tests. The destructive tests were of two 
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Fig. 24 — Soldered connection destructive strength distribution, general wir- 
ing; 590 data points. 


types: (1) a standard strip-off test, shown in Fig. 23, which measures the 
maximum force required to strip the connection from its terminal and 
(2) a standard unwrap test, which requires that the wire shall be capable 
of being unwrapped completely from the terminal without breaking. 

The distribution of strip-off values for the 260 connections destruc- 
tively tested in this manner is shown in Fig. 25. 


A.1.3 Percussive Welded 


All of the general wiring percussive welds, with the exception of the 
inferior weld test, were made using the monitoring criteria developed by 
J. C. Coyne and reported in Ref. 1. These criteria resulted in welds of 
high quality, and a quantitative measure of this quality was obtained by 
destructively testing alternate welds (by order of manufacture). The 
destructive test used was the combined test, illustrated in Fig. 23, which 
consists of bending the vertical weld 90° to a horizontal position and de- 
termining the peak force necessary to break the connection. A total of 
1540 welds (not counting the inferior weld test connections) was manu- 
factured for this test program, and the destructive test data are pre- 
sented in Fig. 26. 
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Vig. 25 — Solderless wrapped connection destructive strength distribution, 
general wiring; 260 data points. 
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Fig. 26 — Percussive welded connection destructive strength distribution, 
general wiring; 940 data points. 
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A.1.4 Resistance Welded 


Although resistance welding has been used for electrical connections 
for a number of years, no generally accepted quality requirements or 
standard method of measuring the weld strength could be found. The 
resistance welds used in this test program were the best which could be 
produced after a reasonable amount of experimentation with the process. 
The destructive strength test chosen was similar to that used for percus- 
sive welds — that is, the horizontal wire was bent 90° to a vertical posi- 
tion and the peak force necessary to break the connection was deter- 
mined. This procedure is illustrated in Fig. 23. Some consideration was 
given to a test which consisted of pulling the wire horizontally along its 
original manufactured direction. This type of test yielded very little in- 
formation about the weld, however, because the connections almost al- 
ways broke in the wire, well away from the weld area. The destructive 
test chosen always broke at the weld and thus gave a much better indi- 
cation of the weld strength. 

The weld quality was evaluated by destructively testing alternate 
connections, by order of manufacture, and thus assuring the quality of 
the remaining connections for the environmental testing program. A total 
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Fig. 27 — Resistance welded connection destructive strength distribution, 
general wiring; 520 data points. 
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of 1040 resistance welds was manufactured for this test program, and the 
destructive test data are presented in Fig. 27. 
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Evaluation of Solar Cells by Means 
of Spectral Analysis 


By H. K. GUMMEL and F. M. SMITS* 


(Manuscript received February 20, 1964) 


An approach toward testing of solar cells is outlined, and a test set and 
evaluation procedure of test results are described. Outer space short-circuit 
current ts calculated from spectral response measurements performed on the 
cells. From this and additional measurements that determine the forward 
diode characteristic, the maximum obtainable power and the voltage at 
which maximum power is delivered are computed. The accuracy of outer 
space short-circuit current predictions is +2 to 8 per cent when suitable 
standards are employed. 


I. INTRODUCTION 


The solar cell is a device which converts light energy into electrical 
energy. The electrical output of the cell, or response, depends on the 
spectral composition of the incident light. For space applications, de- 
termination of the response of solar cells to sunlight not filtered by the 
atmosphere is of great importance. 

In characterization of the performance of a solar cell under outer space 
Ulumination, the most important parameter is the short-circuit current 
under such illumination. Once this current is known, the current-voltage 
output characteristic can be measured under a light source of arbitrary 
spectral distribution but of an intensity adjusted to produce the outer 
space short-circuit current. 

This paper shows that it is practical to obtain the outer space short- 
circuit current from measurements of the spectral response of solar cells. 
Of the alternative approaches, one method relies on measurements under 
terrestrial sunlight while another method attempts to simulate the 
spectrum of the sun. Each of these two methods has practical difficulties 
that offset its inherent simplicity. 

The direct sunlight measurements must be made outdoors unless 
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special buildings or sun tracking facilities are available and should be 
made at high mountain altitudes. Only a few hours around noon on clear 
days can be used for precision work. These requirements constitute a 
considerable difficulty, especially for laboratories located on the East 
Coast. Even if measurements are made at high altitudes, corrections 
must still be applied for the alteration of the spectrum and the reduction 
of over-all intensity by atmospheric absorption and scattering. 

Direct solar simulation requires a light source which is constant in 
time and which has a spectral composition equivalent to that of the sun. 
Absolute spectral measurements of high accuracy are required for 
calibration and maintenance. This problem is eased, however, if cali- 
brated solar cells are available against which the calibration of the 
simulator can be checked. 

The only information available from a solar simulator, as well as from 
direct sunlight measurements, is the total current of a solar cell; detailed 
spectral response information is not obtained. 

In the method which is described here, spectral response measurements 
are performed on the solar cells themselves. The spectral response at a 
given wavelength, multiplied by the sun’s intensity at the same wave- 
length, gives the contribution to the short-circuit current at this particu- 
lar wavelength. The total short-circuit current is obtained by integration 
of these contributions over all wavelengths. The spectrum of the sun is 
thus introduced only in calculations; the problem of building and main- 
taining a sun simulator is avoided. The accuracies with which the spec- 
tral response of the cells can be measured are, in principle, comparable 
to the accuracies with which the spectral distribution of the output of a 
solar simulator can be measured. However, if appropriately calibrated 
standard solar cells! are used, the accuracy of the outer space currents as 
determined by the test equipment can be considerably higher than that 
of the spectral measurements themselves. 

An automatic testing facility based on such principles has been de- 
veloped which, in addition to the spectral measurements, evaluates the 
current-voltage characteristic of the solar cell under test. The measure- 
ments are recorded on IBM cards to facilitate data handling. Evaluation 
of the measurements and any statistical analysis of the results is then 
easily done on an electronic computer. 

This paper gives a description of the test equipment and the testing 
procedure. First, a description of the theory is given that relates spectral 
information and short-circuit current. There follows a detailed account 
of the design of the test set. The procedure for evaluation of the test 
results is described in the last section. 
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II. THEORY 


In the derivation of the relation between the spectral response of solar 
cells and short-circuit current it is convenient to represent spectral re- 
sponse in terms of quantum efficiency, defined here as the number of 
electrons delivered into a short circuit per photon incident on the solar 
cell. This represents an over-all efficiency and includes the effects of light 
reflection at the surface, internal carrier-pair generation efficiency, and 
loss of carriers due to recombination. 

If the quantum efficiency is known, the outer space short-circuit cur- 
rent, Ico; , can be calculated by integration over all wavelengths of the 
product of the incident outer space photon flux density? ¢(\) and the 
quantum efficiency Q(X) of the cell 


Toor = Aq | QOe(A) ad (1) 


where A is the cell area and q the electronic charge. The quantum effi- 
ciency of a solar cell is a smoothly varying function of wavelength. For 
this reason it is adequate to sample the quantum efficiency only at a 
small number of discrete wavelengths. The integral can be approximated 
by a sum 


I scos = Aq 2d QAEDA: (2) 


where the photon flux entering the sum must be smoothed according to 
the intervals at which the quantum efficiency is sampled. 

The d; are the wavelengths at which the quantum efficiency is meas- 
ured, and the A; are determined by the wavelength intervals between 
these points and the integration scheme used, e.g., trapezoidal approxi- 
mation, Simpson’s rule, Gaussian quadrature, etc. 

The quantum efficiencies are measured as shown schematically in 
Fig. 1. Light from a tungsten light source is passed through interference 
filters which are mounted on a turntable. To eliminate the influence of 
fluctuations and drift in the light level, the ratio of the response of the 
cell to be measured to that of a monitor cell is taken. In terms of such 
ratios, R; , the quantum efficiency of the sample cell can be expressed as 


Q(A) = RiQret(rz) (3) 


where Q,er(A;) is the quantum efficiency of the monitor cell. The sum (2) 
can thus be written: 


i Per = 2d RlAqQret(Ai)e (Ai) Ad). (4) 
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Fig. 1 — Method of measuring ratio of quantum efficiencies. 


Defining the quantities in the brackets as “weighting factors” 


W; = A QqQret(Xi)@ (As) As > (5) 
one obtains for the outer space short-circuit current 
Livoe — DRW: . (6) 


The weighting factors include the spectral composition of outer space 
sunlight and the quantum efficiency of the monitor cell. In the deriva- 
tion, outer space sunlight served only as an example. Clearly, the method 
is not restricted to outer space illumination but can be applied, with 
appropriate weighting factors, for any illumination of fixed spectral 
composition and intensity. Practical schemes for obtaining the weighting 
factors in connection with the calibration of a set of solar cells will be 
discussed below. 

As a preliminary step in obtaining weighting factors for outer space 
current prediction, a set of weighting factors referred to as terrestrial 
weights was determined for the short-circuit current of solar cells 
measured on a clear day near noon at a high altitude. The set of solar 
cells which were to be calibrated as standards had a wide range in spec- 
tral response, as shown in Fig. 2. It included cells bombarded with 
various fluxes — up to 2 X 10°°/em’— of 1-Mev electrons and virgin 
cells. 

To find the terrestrial weighting factors, first the quantum efficiency 
of the monitor cell at the various wavelengths is measured on a relative 
scale by comparison of the monitor cell output with that of a spectrally 
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flat detector, such as a thermocouple. Combining this information with 
relative spectral intensities of the sun under which the observation was 
taken, one obtains the short-circuit current to within a common constant 
factor for all cells. This factor is readily determined from the actual 
measurements of the short-circuit currents. It is significant that only 


80 





QUANTUM EFFICIENCY IN PER CENT 


0.4 0.5 0.6 0.7 0.8 0.9 1.0 
WAVELENGTH, pe 


Fig. 2 — Quantum efficiencies of set of cells used as standards. 
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the current measurements determine the absolute scale, while all other 
measurements need to be performed on a relative scale. 

However, if the set of solar cells used in such measurements has a suf- 
ficiently wide spread in spectral response characteristics, it is possible to 
find the weighting factors even without any prior information on the 
quantum efficiency calibration and on the spectral composition of the 
sun. The appropriate weights are simply those which reproduce the cur- 
rent of all cells with the least error. 

In the present work, the terrestrial weighting factors were determined 
by a judicious combination of the two methods. Fig. 3 shows the result- 
ing comparison of the measured short-circuit current and the calculated 
current. In this figure the most heavily prebombarded cells appear at the 
lower left-hand corner. 

To convert the terrestrial weighting factors into outer space weighting 
factors, it is necessary to increase the weighting factors at each wave- 
length by the ratio of outer space solar intensity to terrestrial solar in- 
tensity at this particular wavelength. Such information is readily avail- 
able from solar spectral recordings performed by the Smithsonian 
Institution.! For the set of solar cells shown in Fig. 2 an outer space 











CALCULATED CURRENT IN MILLIAMPERES 





MEASURED CURRENT IN MILLIAMPERES 


Fig. 3 — Comparison of directly measured current with current calculated 
from spectral information. 
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extrapolation was performed by this procedure. Thus, weighting factors 
were determined for the prediction of outer space short-circuit currents 
from the spectral measurements performed on the test set. 

The calibrated cells serve as standards for the weighting factors. Thus, 
the monitor cell in the test equipment need not provide the long-time 
standard. It serves only as a short-time standard that can be compared 
readily with the calibrated cells. 

For further characterization of solar cells, additional measurements 
under tungsten light filtered through heat-absorbing glasses, later referred 
to as ‘‘white light,” are performed. These are the short-circuit current, 
the open-circuit voltage, the current delivered into a 10-ohm load, and 
the current delivered into 0.45 volts. The reverse leakage and the forward 
voltage with 50 ma passing through the solar cell are measured in the 
dark. 

In a special mode of operation the test equipment can form the 
weighted sum, thus giving the short-circuit current directly. In general, 
however, it is preferred to evaluate all the test results on a digital com- 
puter. 

The obtained outer space short-circuit current is then used in con- 
junction with the measurements under the white light to calculate the 
over-all output characteristic for outer space illumination, in particular 
the maximum-power point and the voltage at maximum power. 

In addition, the short-circuit current under the white light source is 
also calculated from the spectral response with an appropriate set of 
weighting factors. The percentage difference between the calculated and 
the measured short-circuit current is determined and recorded. Gener- 
ally, this difference is small, indicating that the outer space short-circuit 
current calculation can be trusted. Occasionally, solar cells are observed 
in which the calculated current deviates substantially from the measured 
current. This indicates that the particular solar cell has a nonlinear 
response, e.g., its quantum efficiency is light level dependent. The results 
for the outer space short-circuit current obtained on such solar cells are 
correspondingly in error. Alternative methods of calculation that are 
applicable to such nonlinear cells are described in Section IV. 


II. TEST EQUIPMENT 


As mentioned, two groups of measurements are performed in the test 
set. One group of eight measurements evaluates the spectral response of 
the cells, while the measurements in the other group evaluate the cur- 
rent-voltage characteristic of the solar cell. 
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For those tests in which the output is porportional to the incident light 
intensity, the ratio of the response of the sample cell to that of a reference 
cell is measured. This eliminates the effect of drift and fluctuations in 
the light source and the interference filters. 

The set consists of the following major components: 

1. optical system comprising light source and interference filters, 

2. mechanical system providing for optical filter transport and actua- 
tion of control switches, 

3. electronics and ratio-formation, 

4. control system, giving control commands for operation of counters 
and punch-out, and providing for the routing of the signals through 
appropriate amplifiers and attenuators, etc., and 

5. output channels: printer or translator and card punch. 


3.1 Optical and Mechanical System 


The optical system is shown schematically in Fig. 4. A 1-kw inean- 
descent projection lamp (General Electric type PH/1M/T20MP) serves 
as the light source. The light is focused by a spherical mirror (diameter 
12 inches, focal length 52 inches) and directed by a plane mirror towards 
the sample and reference cells, which are placed adjacent to each other 
in a light-tight box. To insure uniform illumination of both cells, dif- 
fusing glass is interposed into the light beam. Long-wavelength light is 
attenuated by heat-absorbing glass and nearly monochromatic light at 
various wavelengths is selected by narrow-band interference filters. 
These filters are mounted on a disk that is driven intermittently by a 
Geneva motion and interposes the filters sequentially into the optical 
path. The filters have transmission bandwidths of the order 0.01 u. For 
the evaluation of silicon cells, filters at the following wavelengths are 
used: 0.4, 0.45, 0.5, 0.6, 0.7, 0.8, 0.9, and 0.95 uw. For measurements on the 
solar cells in the dark, four positions are blocked; the remaining four 
positions admit white light for measurements of additional solar cell 
characteristics. The white light intensity is adjusted to generate in a 
typical solar cell a short-circuit current that is comparable to the short- 
circuit current expected in outer space. 

The important factors of the mechanical system are also shown in 
Fig. 4. A 1-rps motor drives the Geneva motion, which advances the 
filter disk. Thus the time available for any individual test is 1 sec. The 
motor axle contains a set of adjustable cams that actuate microswitches 
for the timing of various functions within a single test cycle. The wipers 
of a set of 16-position rotary switch decks are driven by the filter disk 
axle to permit the selection of the different types of tests to be performed. 
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Fig. 4 — Optical and mechanical system. 


(3.2 Electronics and Ratio Formation 


The various signals encountered are measured by a combination of 
voltage-to-frequency converter (Dymec 1122B) and counter. This com- 
bination, when used in both the sample and reference channel, makes 
fast and accurate ratio formation possible. 

If the input voltage at the sample channel is V, volts (see Fig. 5), and 
if the voltage-to-frequency converter (vfc) gives A cycles per second 
per volt, the total count N, obtained in ¢ seconds is 


N, = V.At. (7) 
Similarly one obtains in the reference channel 


N, = V,At. (8) 
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Fig. 5 — Method of ratio formation. 
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RATIO FORMATION: Vs/ Vp 


To form the ratio V./V, the reference counter is used as a scaler, in 
such a way that it gives an output pulse for every 1000 input pulses. The 
time between output pulses is then 


1000 


ae ae 


(9) 





The output pulses from the scaler gate the counter in the sample 
channel in such a way that one pulse turns it on and the next one turns 
it off. Thus ¢ in (7) is identical to that in (9). This leads to 


N, = 1000 (V./V,) (10) 


i.e., the count in the sample channel counter gives the desired ratio of 
the voltages with adequate resolution. 

The voltage-to-frequency converter and counter combination, when 
used as a voltmeter, gives an output that represents the input voltage 
averaged over the time of measurement. Thus fluctuations are smoothed 
out. The effective noise bandwidth is therefore the reciprocal of the time 
of measurements. In spite of the small noise bandwidth, however, this 
combination has a fast transient response. Thus, high-resolution, low- 
noise measurements during short time intervals are possible. With a 
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wideband amplifier (Kintel type 112) preceding the voltage-to-fre- 
quency converter, the equivalent noise at the input that actually has 
been observed is of the order of 1 uv for 0.1-second measurement in- 
tervals even after an estimated 750 hours of operation of the system. The 
chopper at the input of the preamplifier deteriorates with time; for a 
new chopper the noise may be considerably below 1 pv. 

The signals to be measured are in the range 0.5 mv to 10 mv except 
for the spectral response measurement at the shortest wavelength, 0.4 y, 
where the output of the light source is low and the signal is about 100 uv. 
For all other wavelengths the amplifier noise thus contributes less than 
0.2 per cent to the measurement error. Since the contribution of the 
0.4 » spectral region to the total short-circuit current is small, the 
error contributed by the amplifier to the total short-circuit current is 
also below 0.2 per cent. 

The short-circuit current is obtained from the measured voltage drop 
across a shunting resistor. The resistor has to be of sufficiently low value 
that the voltage developed across it would result in a forward current in 
the dark that is negligible compared to the short-circuit current. For the 
spectral measurements, a resistor of 10 ohms is adequate, while for the 
white light measurements a 0.1-ohm resistor is used. The switching of 
resistors at the low signal levels has to be done carefully to prevent in- 
troduction of spurious thermal emf’s. Mercury relays are employed in an 
arrangement shown in Fig. 6. For the measurement of the open-circuit 
voltage (3 volt) and other voltages of comparable magnitude, the pre- 
amplifier is bypassed and the signal is fed directly into the voltage-to- 
frequency converter. 


3.3 The Control System 


The programming of the various tests is accomplished through the 16- 
position switch coupled to the axis of the drive motor. Microswitches 
control reset commands at the beginning of each test; approximately 
in the middle of each test, (after the amplifier transients have died down) 
a new counting cycle is initiated. The last operation within each test is 
the print command to the printer or IBM punch. 

The switching of the signal paths for the various tests is accomplished 
by relays which are activated from the 16-position switch through a 
diode matrix as shown in Fig. 7. 

The following quantities are measured in the tests: in the eight spec- 
tral readings, the ratio of the short-circuit currents of sample cell and 
reference cell at eight wavelengths, 
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Tig. 6 — Switching of signal paths. 


Trey , the current flowing through the cell when a reverse voltage of 
5 v is applied through an 800-ohm resistor, 

V., the voltage developed across a load resistor Rk, (10 ohms) when 
the cell is illuminated: by white light (see optical system), 

V;, the voltage across the cell when a forward current J; , normally 
50 ma, is passed through the cell in the dark, 

I,,, the short-circuit current measured directly under white light. 

Iscr , the ratio of the short-circuit current of the sample cell under 
white light to that of the reference cell (In this measurement the ampli- 
fier gains are set such that for proper adjustment of the light source the 
numerical value of Iscr is the same as that of J, .), 

I4s, the current delivered by the cell under white light illumination 
into a 0.45-volt voltage source, and 

Ve, the open-circuit voltage of the cell under white light illumination. 

The short-circuit current is measured both directly (U,.) and nor- 
malized (ser). In the linearity test, the normalized current is to be used, 
as it is compared with the normalized spectral readings. In the evalua- 
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tion of the output characteristics, the unnormalized short circuit current 
is used in conjunction with the open-circuit voltage and the voltage 
across a load resistor. 


IV. EVALUATION OF THE SHORT-CIRCUIT CURRENT 


The short-circuit current is evaluated from the raw data according to 
(6). This evaluation can be performed either directly by the test equip- 
ment concurrently with the measurement of the R; or by a separate 
calculation, preferably on an electronic computer. 

For the evaluation in the test set the counter is not reset between the 
spectral response measurements P;. The multiplication by W; is carried 
out by interposing attenuators between the preamplifier and voltage- 
to-frequency converter of the sample channel. Such attenuators are 
switched by the 16-position switch, which is coupled with the axle of 
the filter disk. The number added into the counter at each filter position 
equals the current contributed by the wavelength band represented by 
the filter, and the final number on the counter gives the total short-circuit 
current. 

The direct test set evaluation of the short-circuit current is of impor- 
tance if no computing facilities are available or if results are needed 
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Fig. 7 — Diode matrix for activation of relays. 
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immediately. On most other occasions, it is preferable to have the com- 
puter process the raw FR; data, since one set of data can then be analyzed 
for cell response under various light sources. 

In addition to providing short-circuit current under outer space il- 
lumination, the computer program, as used for routine evaluation of 
solar cells at Bell Telephone Laboratories, evaluates the short-circuit 
current for the white light that is used in some of the tests of the output 
characteristic, as discussed in Section II. For a check of consistency, the 
program then computes the percentage deviation of this current from the 
short-circuit current measured directly under white light. This check is 
important for the detection of cells whose short-circuit current changes 
nonlinearly with light intensity. 

As an additional feature, the program computes the ratio of the com- 
puted outer space current to the computed white light current. This 
quantity, CC, is of interest for a number of reasons. If the maximum in- 
tensity of the white light falls at a different wavelength than the maxi- 
mum intensity of outer space sunlight, then CC provides a “color index” 
for the cells. In the test set described here the white light is deficient in 
infrared and red light due to filtering with heat-absorbing glass. There- 
fore, red-sensitive cells have large CC numbers while blue-sensitive ones 
have smaller CC numbers. 

In the standard procedure for calculating the outer space short-circuit 
current, the measured R; values are multiplied by proper weighting 
factors W; and summed as in (6). Fluctuations in intensity and spectral 
composition of the light source cause no first-order error, since the R; 
are ratios. However, if the cell current varies nonlinearly with light in- 
tensity, the current at full solar illumination, as calculated from spectral 
measurements at low light intensity, is in error. From the comparison of 
measured white light response and calculated white light response one 
knows whether or not a cell is linear. Nonlinear cells are encountered 
rarely enough so that they present no serious problem. Nevertheless, the 
program can be used — if required — in such a way that errors due to 
cell nonlinearity are minimized; however, errors will be introduced if the 
spectral content of the white light changes. The procedure is to multiply 
the measured white light current I.2 by the factor CC. Since nonlineari- 
ties cause errors in both numerator and denominator, the influence is re- 
duced to second-order effects. 


V. THE OUTPUT CHARACTERISTIC 


As discussed before in connection with the short-circuit current for 
outer space illumination, the output characteristic of the solar cell could 
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be established under a light source of arbitrary spectral composition but 
adjusted in intensity to produce the known outer space short-circuit 
current. To do this, however, would be a very tedious and time-consum- 
ing procedure. As an alternative, one measures the output characteristic 
under a light source that generates a short-circuit current close to, but 
not necessarily equal to, the outer space short-circuit current. It then is 
possible to calculate the outer space response from these measurements 
and the known outer space short-circuit current. 

For such calculations of the output characteristic the following model 
of a solar cell is used: 


q(V + IR) | _ 
I= Tse — Io {exp [ev | i}, (11) 


Here J is the output current at voltage V, J,, is the light induced cur- 
rent (under the given illumination) and corresponds to a current gener- 
ator in parallel with the solar cell diode. R, is a lumped resistance ap- 
proximating the contact resistance and sheet resistance in the front 
layer of the cell. 7, , the diode forward-current constant, is orders of mag- 
nitude smaller than J,, ; therefore the term (—1) in (11) can be neg- 
lected. With this simplification one can express J, in terms of the open- 
circuit voltage, V.., i.e. that voltage for which the left side of (11) is 
zero, which leads to 


qV+ RI —- Yo) (12) 


Pised E — exp kT 


An ideal diode would have a value of 1 for the coefficient n in the 
exponent. A number of effects” cause deviations from this simple be- 
havior, and frequently the output curve cannot be fitted by a constant 
value of n over the entire voltage range and for different light intensities. 
However, one may fit the output characteristic over a limited range near 
the maximum power point with a constant » for a particular light in- 
tensity. Only small errors are introduced if one uses the same value of n 
near the maximum power point under slightly different illumination. 

Similarly the value of the resistance R, in (12) is not strictly a con- 
stant, since it involves a spreading resistance in the thin diffused layer. 
Nevertheless, in general R, will be a very weak function of current and 
light level so that, for this calculation, it can be treated as a constant. 

With these assumptions the output characteristic under outer space 
illumination is determined by the four quantities J,,, V.., n, and R,, 
which must be established by measurement. The first two of these de- 


* Among these effects are space-charge recombination and sheet resistance. See 
also Ref. 3. 


1118 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


pend on the particular light level. From the measurements of J, , Voc , 
and V, under the white light, and from the measurement of V; in the 
dark, all four quantities can be extracted. By replacing J, in (12) by 
the known outer space short-circuit current J,,., and setting J = 0, one 
obtains the open-circuit voltage V,..; for outer space illumination. The 
parameters Iscos , Vocos , 2, and R, are now used to characterize the output 
characteristic of the cell under outer space illumination. This permits a 
calculation of all quantities of interest according to (12), in particular 
the maximum power point. An approach for performing these calcula- 
tions on an electronic computer is given in the Appendix. 

Most of the measurements on the test are reproducible over periods of 
hours with an rms deviation from the mean of about 0.5 per cent. If 
calibrations on the test set are to be made to an accuracy approaching 
0.5 per cent or less, the fine adjustments become very tedious, as the 
criterion has to be the result of a statistical analysis of several measure- 
ments. 

A preferable procedure is to make adjustments on the test set to a 
moderate accuracy and to have the computer apply fine corrections. 
Two modes of operation are used. In one mode these corrections are ap- 
plied manually as parameters that are entered with the data. In the 
other mode, a set of standard cells is measured along with the cells to 
be evaluated. The data cards of the standard cells also contain their 
calibrated outer space short-circuit current value. The computer can 
then determine the percentage deviation of the calculated value of the 
outer space short-circuit current from the calibration value for each of 
the standard cells. The calculated values of J;..; for the cells under test 
are then corrected on the basis of the average of these deviations. A 
similar correction is applied to the white light current. 

With the latter mode of operation, the long time standard is provided 
by a group of standard cells, and the built-in monitor cell serves only as 
a short-time reference. If there are no uncertainties introduced by long- 
time drifts of the standards, absolute accuracies of outer space short- 
circuit current predictions of 2 to 3 per cent should be realizable. Com- 
parison of preflight predictions with flight data on the Telstar satellites 
and the Anna 1B satellite* confirm that an accuracy of 3 per cent or 
better has been achieved. 
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APPENDIX 


A.1 Method of Calculation of n and R, 


The quantity n is determined by fitting the output characteristic be- 
tween V,; and V,,. As one has to make allowances for R, , the series re- 
sistance, an iterative scheme is convenient. Assume for a moment that 
the series resistance /?, is zero. One can then compute the value of n or 
nkT/q from (12) 





nkT = Ve a Vi 
i ie (1 _ Vn ) (13) 
Ril sc 


Using this value of nk7'/¢ one can compute the value of the voltage 
across the cell for a current J,, — J; through the illuminated cell 


Ve a Veg te Oke In (74) . (14) 
q Ls 
If there were no series resistance, then the measured voltage V; should 
be equal to V.. Because of the series resistance, the voltage V; is higher 
by R,:-J;. One thus obtains an initial estimate of the resistance 
R, = Vig a Ve. (15) 
[; 
Now this value of R, is used for an improvement of (13). During the 
measurement of V;, current is flowing out of the cell, and the junction 
voltage is the measured voltage V, increased by the voltage drop 
(V,/R1)/R, . Thus one obtains 


fi, 
q es s 


Vi 
In (2 Rule 1.) 


The new value of nkT'/q thus obtained can be used to compute an im- 
proved value of V., and in turn, of R, , and so on. The convergence of 
this procedure is quite rapid, since nkT/q in (14) is multiplied by 
In (I;/Is-¢) and I; was chosen to be near I,, . 


A.2 Methed of Calculation of Outer Space Quantities 


Using the quantities J,,, Voc, (nkT/q) and R,, as determined in the 
previous section, the outer space quantities are evaluated as follows. 
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Outer space open-circuit voltage: 


eet (17) 


qd Le 


Voce = Voc a 


For further evaluation of the charactcristics it is convenient to introduce 
the following normalizations: 


qV 














ao nkT' (18) 
u= fl 
T secs 
Pebe ee 1 
rel Rios — I = 1 —U (19) 
— WV oc0s 
° nT (20) 
= Qh scos 
B nkT (21) 


The normalized current u at the normalized voltage y is obtained from 
a solution of the equation 


u=1— eter’, (22) 

In the computer program, this equation is again solved by iteration. 
The power at the voltage corresponding to y is now given by 

P = (nkT/q)I scosytt- (23) 


To obtain the maximum power, (22) is rewritten so that the normal- 
ized voltage appears as a function of the normalized current 


y=z2-—fButin(1 — vw). (24) 
The normalized power 7z is obtained by multiplying (24) by u 
a = ulz — But+ In (1 — w)). (25) 


The maximum is obtained by equating the derivative to zero. This 
yields the condition 
g= —~— + 26u — In (1 — u). (26) 


1l—u 





In (25), u is required as a function of z and 8. Again, on the computer 
a solution is conveniently found by iteration. To get (25) into a form 
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suitable for this, a switch is made to the variable x, defined in equation 
(19). Then 


r=2+1-2(1-1)- Ime (27) 


and in this form the equation can be solved by iteration, starting with 
= 1 on the right-hand side. One now obtains for the maximum power 


P ase = L cag V cies Cond (1 == ey, (28) 
and for the voltage at which maximum power is delivered 
Vp =" v1) (148). (29) 
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This paper describes the theory of a unilateral parametric amplifier which 
contains two variable-capacitance diodes separated by a quarter wavelength 
at the signal frequency and a half wavelength at the idler frequency. It ts 
shown that a broadband signal circuit 1s essential in order to obtain unt- 
lateral gain, and that matching conditions are obtainable even with a high 
gain. The optimum noise figure is slightly worse than that of a single- 
diode reflection-type amplifier; however, this amplifier has advantages wf it 
ts refrigerated at liquid helium temperature, because it does not require a 
circulator in front of its input port. The amplifier usually requires an 
isolator at tts output port, since it does not have substantial loss in the 
reverse direction. 


I. INTRODUCTION 


The performance of a single-diode reflection-type parametric amplifier 
is often limited by the availability of a good circulator, which is essential 
to a practical amplifier. This becomes a more serious problem when the 
amplifier is to be refrigerated down to liquid helium temperature, since 
satisfactory circulator performance is then difficult to obtain. 

A unilateral parametric amplifier with two diodes, originally proposed 
by Baldwin,! does not require any circulator or isolator in front of the 
amplifier if the signal source impedance is reasonably well matched. 
Therefore, this amplifier might avoid the difficult circulator problems. 

This paper is prepared to show the theoretical characteristics of a 
unilateral parametric amplifier with two diodes separated by a quarter 
wavelength at the signal frequency. In Section II, an exact expression 
for the scattering matrix of a simplified model of the amplifier is ob- 
tained. In Sections III and IV, expressions for power gain, noise figure 
and bandwidth are calculated. In Sections V and VI, the optimum noise 
figure and some design considerations of the amplifier are described. 


1123 


1124 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


II. PRESENTATION OF THE SCATTERING MATRIX OF THE AMPLIFIER 


Fig. 1 shows a basic configuration of the unilateral parametric ampli- 
fier. The black box shown in Fig. 1 is a symmetrical lossless reciprocal 
two-terminal-pair network whose image impedance Z and phase con- 
stant 6 are assumed to be real quantities. Stationary susceptances of the 
diodes are considered as a part of the black box, and losses in the diodes 
are included in the external loads. C' and C’ in Fig. 1 represent the sinus- 
oidally varying shunt capacitances which play an essential role in the 
mechanism of amplification. Parameters concerned with the right-side 
arm of the black box are indicated by primed letters, and those of signal 
and idler frequencies are indicated by suffixes 1 and 2 respectively. 





Vig. 1 — Basic configuration of the unilateral parametric amplifier. 


Currents and voltages of the varying capacitance C are related by the 
following equations: 
421 => jon (6/2) v2" 
i a (1) 
ten = — Juo(e'/2)y 


where w is angular frequency and the asterisk indicates a complex 
conjugate. The terminal currents and voltages of the capacitance C’ are 
expressed in terms of incident and reflected waves a, b, a; , and b; normal- 
ized by Z as follows: 


v= ~VYZ(at b) = VZ(a; + bi) 
i= Sy la-d) (2) 


ti _ 77 (a: = b;). 
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The directions of flow of these waves are indicated by arrows in Fig. 1. 
The continuity equation of current at the terminal C is given by 


P= t+ 7;. (3) 
Combining (1), (2), and (3), we obtain the following equations: 
ay = a joka” = jonide” 
ba = b) + Jerky” = jerkbo™ 
Aig = Ag + Jwok Ay + Jo by 
ee = bs = jerk ay — joi by 
where « is a complex number defined as: 
ZZ. 
5° 
Equations similar to (4) are obtainable for the varying capacitance C’ 
by replacing unprimed parameters by primed ones in (4). 
Waves of the lossless reciprocal two-terminal-pair network are related 
by the following equations. 


: 





(5) 


K= 


ol Es 


b; = ea! 

b,’ — e a; é (6) 
Substituting (4) and the similar equations for C’ into (6), we obtain the 
following set of equations: 


Sy a ae a ee = ee ee er ee 
by + jorkbe” + jurk’e "be = ear’ — jorkag” — jark’e "ar! 
eo ee: a Sata —j0 ag go 
bY + jae 7b." + jonk’be’” = € a, — joe 7" 


x. if “yee BR: x . ak . pk 96: Oo pk 
—jJunk by — jook’ eb + be” = jak” ay + jak’ e” ay’ + e7 7a,’ 


* . “7 pk 
Az, — JW1K Ae 


(7) 
— junk *e7*by = jerok! by + bs* = joke ay -+- jek! ay ++ e7q,* 
Assuming the same magnitude for the complex quantities « and «’, and 


solving (7) with respect to bj , by’, bs", and b/", we obtain an exact form 
of the scattering matrix for the circuit shown in Fig. 1. The matrix is 


as follows: 
by Su Sp Sis Siu ay 
bi’ ae S21 Soe Seg Sag ay’ 
~ (8) 


ie A Sse S33 Sys | 
be!” Sa Se Sis Sas a 
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Su-A = SoA = 2wwox” exp (—76:) [cos 6, + exp (72) cos 6, 
— 4w7we'k® exp [j(62 — 41)] sin A sin 6. 
Ss3°A = Sage A = Qoryexx” exp (762) [cos 02 + exp (—761) cos Oy] 
— 47k exp [7(6 — 61)] sin 01 sin 6 
Sw-A = exp (—7j61) {1 + j2ww2K” 
-exp [j(6 + 6))] sin 61} 
Soy-A = exp (—JO,) {1 + j2awox” 
-exp [j(62 — Op)] sin 61} 
Ss4-A = exp (j62) {1 — J2wwon 
-exp [—Jj(@1 + 4)] sin 62} 
Saye A = exp (j02) {1 — J2awox” (55 
-exp [—j(1 — p)] sin 6} 
Si3-A = — (a1/a2)Se-A = —Joyx exp (76,/2) 
-{1 + exp [j( — 01 — O)] — Seer” 
-exp [j(02 — 61)] sin 6, sin 62} 
SoA = — (a1/w2) S31-A = —jox exp (—76p/2) 
-{1 + exp [j(0 — 01 + Op)} — Seren’ 
-exp [7(@2 — 6,)] sin 6: sin 62} 


Su: A = — (wi/a2)S3-A = —jow exp (j0p/2) {exp (762) 
+ exp [—j(@ + 9)]} 
So3°A = — (1/2) Sa-A = —Jjorx exp (—J6p/2) {exp (J62) 


+ exp [—j(A — 6)]} 
and 


A = 1 — 2wywox"(1 + exp [j(62 — 61)] cos 4) (10) 

+ Aw work exp [7( 0, = 6,)] sin 6; sin Ao 
where x and 6, are real numbers, and represent the magnitudes of « and 
x’, and a difference of pump phases at two diodes respectively as shown 
in the following equations: 
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(0/2 
xe’ pl 


j05i2 (11) 


= 
k’ = Ke 
Assuming the following phase relationships 
A = (2m + 1)(a/2) 
6 = nw (12) 
6, = (2m + 2n + 1)(2/2), 


we obtain the following special relations for the waves: 


by = (=) ay’ 


mtis 1 + arnwer” pai emer” 
by SS (a ee = ee 
: rg 1 — 2wwox? i ae 1 — Qwrwox? 
ee Qurxe 7”? he 
1 — 201 00K? : 
bo* = 7 Qunxe VP Qwywor ity 1 an! (13) 
: J 1 — 2wiwor? > 2w1wek? ‘ 1 — 2wiwox? ° 
. Lore 7??? 1 
b ly = Road n rs n 
: ( ) J 1 — 201K? o + ( ) 1 — 2w1woKk? a 
QaywoK” Is 
. 1 — Qwywox? .? 


In (18) it is found that signal waves are completely matched in each 
direction, and that the signal incident wave a;’ propagates from the 
right to the left with no gain nor loss and has no interaction with the 
other three waves. When both idler ports are properly terminated, the 
transducer gain of this amplifier for the incident wave a;— that. is, 
| Sor |” — equals the square of the ratio (1 + 2wywok")/(1 — 2u1wox”). If 
the surge impedance of the circulator of the reflection-type amplifier at 
the signal frequency equals the image impedance Z, and the idler load 
impedance at the idler frequency equals the image impedance Z,, the 
gain formula is identical to that of a single-diode reflection-type ampli- 
fier, | (1 + 4ewox”)/(1 — deux") |), except for a factor of 4 in the 
pumping term 2w,«°. Thus, a perfectly matched and unilateral para- 
metric amplifier can be achieved by means of the phase synchronization 
shown in (12). 

If all frequency characteristics of 4,, 62, Z, and Z». are known, it is 
possible to calculate the frequency characteristic of the amplifier from 


1128 THE BELL SYSTEM TECHNICAL JOURNAL, MAY 1964 


(9) and (10). Since this method is rather tedious, a simplified method 
will be developed in the following sections. 


II. EFFECTS OF REFLECTIONS, BANDWIDTH 


For the sake of analytical simplicity we first assume that phase con- 
stants are frequency independent and satisfy the condition given by 
(12), but we introduce the frequency-dependent reflection coefficients of 
the external loads. These reflection coefficients can be modified to include 
the effects of frequency-dependent phase constants. 

The incident and reflected idler waves are related by the idler reflection 


coefficients I, and I.’ as follows: 
* *, * 
a = T, be 
* Bo * 
Qu! => T,’ b,! z 


(14) 


Substituting these relations into (13) and eliminating a2", a2’*, b.* and 
b.’* from the equation, we obtain the following equations: 


by = Sye’ay’ 
bi = Soa oe, 
where 
Sy! = (-)""5 
a! = (| 
(16) 


l/l 1 
Swwor” -+- 3 (4 -+- aa) 


19 s 9 : 
(2a _ A Rowe) (2a — La ieee es | 


-++ 


Equation (15) shows that any reflection in the idler circuits does not 
deteriorate the unilateral characteristic of the amplifier. (A change in 
idler phase, 62, however, causes deterioration in the unilateral charac- 
teristics of the amplifier: see Appendix. ) 

Next let us consider a special case in which the amplifier has sym- 
metrical idler terminations, T.’ = I. Substituting this condition into 
the second equation of (16), we simplify the equation as follows: 
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1+ QurrcsoK” _ Tt 

SS ee (17) 
1— 1 1 + 2a iwox 
1 — awe? 


The reflection coefficient T, is expressed in terms of the image impedance 
Zz and the idler load admittance Yr. as follows: 


_1-— Yrkp 
tS TF Ved’ ue 
where 
Yrn= Gn + jBrm. (19) 


This admittance may include parasitic elements of the diode. Substitut- 
ing (5), (11) and (18) into (17), we obtain the following relation: 


2 
m+1+ Yro* + W102 | c | Ly 


So = (-)""79 Vato eka (17°) 


This equation shows that the idler image impedance Z, does not affect 
the gain. Only the idler load admittance, Y 72 , is of primary importance 
for transmission gain of the amplifier. Therefore, we can assume here- 
after that the idler image impedance equals 1/G'72 in order to avoid the 
reflection at the center frequency of amplification. The square root of 
the power gain at the center frequency, ~/ PG, is given by the following 
relation: 


Gre + ZW1We | Cc |’ “1 
= 9 = 2 
Ve ea teenie rT oa 
and a half-gain bandwidth, (Af)sa», is approximately determined from 
the frequencies where the susceptance component of the idler load equals 
the denominator of (20), 1.e., 


| Bro | = Gr — FO1W. lc ’Z1 (21) 


where the right-hand side of (21) is a slowly varying function of fre- 
quency in comparison with By . 

Now let us consider effects of small reflections in the signal circuit. 
We assume that both the signal source and load admittances are repre- 
sented by (1/Ro) + 7B: as shown in Fig. 2, and waves ao1 , bor , dor’ and 
bo:’ are normalized to Ro instead of Z, . Expressing voltages and currents 
in terms of waves as was done in (2), we obtain the following equations 
from the equations of continuity of currents and voltages: 
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a= (ht "a Ju) or ae (hy — ju; )bo1 


(22) 
bj) = (4 + Jur) aor ig (ky + Jur) bor 
where 
1 Ro Zy 
ee fvoL ba 
a z( Tee Zi 
_1f, /Ra Za 
as 3( ZV Ra ee) 
B ees 
k= > V/Ruti : 
Substituting (23) into (15), we obtain the following relations: 
bo = oy ae + Si aor’ 
(24) 
bo’ = So aor + Soo aor! 
8 = So = key = Jus. —Ty* + Sy’ So'T; 
ky + jor 1 — Sy'Sa/T? 
ky — ju. Sw'(1 — [Tr |) 
Sn Se 25 
7 ky + jour 1 — Sp'Sn/TP 20) 
o _ ka — Jur Su’ (1 — [TL |) 
So1 ee est EES ee 


ky + jm 1 — Sp’ Sa'T? 


where Tj is a reflection coefficient of the signal source and load normalized 
to Z, , and is defined as follows 


1 1 
: See Ge B 
unin Amt) gg 
ky + jm z+ (qi + 3B) 
“1 Ro aos 


Equation (25) reveals that any reflections in signal circuits cause an 


1 


«by = hb, b;—» boi 
4o,—> ay—> <=} «9, 


A 
21] SHOWN IN |Z, 
FIG. 1 





Fig. 2— Forward and reverse waves in the signal circuits. 
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Fig. 3 — Normalized forward gain for various values of a?/g vs normalized 
idler reflection coefficient gy. 


internal feedback and deteriorate the unilateral characteristic. To sim- 
plify the analysis, we assume that Ty is proportional to T, and that both 
of them are small imaginary numbers, ie., Tz = jy2, U1 = jove, | Ti |? « 
1, and | I, | < 1. Substituting the first equation of (16) and (17) into 
(25), we obtain the following scattering matrix elements, which charac- 
terize the frequency dependence of the amplifier if g > 1: 

reflection: 


esa 23 | Sox | pel ora 
{1 = 9 (on) + (gy2)? 


reverse gain: 


1S | V1 + (972)? 
. ee) 2 (27) 
at ae (on) + (gy2)? 


Sa | ~ g 


1-5 wy} + (oy) 





forward gain: 
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Fig. 4 — Reverse gain for various values of a?/g vs normalized idler reflection 
coefficient gre. 


The characteristics of these elements with various values of the parame- 
ter a’/g, as a function of gy2 , are shown in Figs. 3, 4, and 5. 

It is shown that the amplifier has a maximally flat gain characteristic 
when a’/g = 4, as follows: 


| oo | ~ eens eee : 
(yv2)* (28) 
owe 


A half-gain bandwidth is determined from the frequencies where the 
following relation is held: 


love | & V2. 


Similarly, for the amplifier with a matched signal source and load, the 
bandwidth is derived from the following equation: 


| ov2 | 1. 


Therefore, by introducing a proper mismatch in the signal source and 
load and assuming that 72 is proportional to a frequency change, we can 
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obtain about +/2 times larger bandwidth than that of the matched 
amplifier. But this enlarged bandwidth is obtainable only at the expense 
of deterioration of unilateral characteristics, as shown in Fig. 4. 


IV. EFFECTS OF LOSSES; NOISE FIGURE 


In the previous discussions, losses in the diode have been considered a 
part of the external circuits. However, in order to study the noise per- 
formance of the amplifier the diode losses have to be separated from the 
external circuits. 

In Fig. 6, Ga and G.’ represent the equivalent loss conductance of the 
diodes, and they are accompanied by the noise current generators tna 
and tna’. Gr, and G, represent the load and signal source conductance, 
respectively. 

In order to eliminate feedback effect due to mismatches at the signal 
ports, we assume that the signal output port is perfectly matched to its 
image impedance Z,. Thus, we can eliminate any feedback effect even 
if there is mismatch at the signal input port. The matching condition at 
the signal output port is expressed as follows: 


(Gr + Ga’)Z = 1. (29) 


In order to feed the maximum power into the black box A shown in Fig. 6, 
the signal source impedance has to be matched to the input impedance 
of the amplifier, which includes the diode loss conductance Gz. This 
matching condition is expressed as follows: 
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Fig. 5 — Normalized input reflection coefficient for various values of a?/g vs 
normalized idler reflection coefficient gy2 . 
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Fig. 6 — Noise sources of the unilateral parametric amplifier. 


G;Z1 = 1 + GuZy e (30) 


Note that the condition in (80) does not necessarily mean that the signal 
source impedance is matched to Z; . 
The maximum power available into the black box A of Fig. 6 is given 
by 
P (ee P a 
GZ. 1+ GaZ,’ 





laf = (31) 
where P, is the maximum available power from signal source. The power 
delivered to the load, denoted by Po, is given by the following equation: 


Po = G12, | by’ A (32) 


Therefore, from the second equation of (13), and assuming no input 
signal at the idler frequency, we obtain the following gain expression: 
Po GZ 2 

PG =5 -T1LG@g,! (33) 
where g is given by (20). [In (20) we assumed that the idler loads are 
not necessarily matched. However, as stated there, we can assume the 
matched idler loads without loss of generality.] This gain expression is 
very similar to that of a single-diode reflection-type amplifier, as men- 
tioned in Section II. 

Assuming noise sources are thermal and the temperatures of the am- 
plifier, the signal source and the load are the same, denoted by T° 
Kelvin, noise currents and noise wave are expressed in the following 
relation: 


Wee ex tad 

4G, 4G,’ 

where k = 1.38 X 10°” joule/°Kelvin is Boltzmann’s constant and Af 
is the infinitesimal bandwidth concerned. 

Though we have a mismatched signal source with respect to Zi, the 

difference between the noise power from a matched signal source and 

that from the mismatched signal source is exactly the same as the noise 








=|a'/ =|a@| =|’ f= kT af (34) 
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power reflected at the terminal of this mismatched signal source. This 
reflected noise power is originally from the perfectly matched signal out- 
put network through the amplifier without gain nor loss in the reverse 
direction. Therefore, the resultant incoming noise wave into the 
amplifier is the same as mentioned in (34) 


a, |? = kTAf. (34’) 
Substituting (34) and (34’) into the second equation of (13), we 


obtain 
lb’ P= (1 ees (1 = *) kT Af. 
We g 


Therefore, the noise output power due to | bi’ |)’ is given by 
Peaeae | eet (1 = 5) kT Af. 
Wwe g 


The directly radiated noise power into the load from G,’ is also the other 
part of the noise output power. We obtain the following equation for 
this noise output power: 


Poe = G,21G7¢Z2,kT Af. 


Therefore, we obtain the following expression for the noise figure of the 
amplifier: 


pa Pat Pe 
PG)kT A 
(PORT Af - = 
= 14 Gia) {1+ SF 4 a — Hh 
g We g 


It is shown in (35) that GyZ, and w/w. should be smaller in order to 
obtain a better noise figure. And the expression for the noise figure is 
very similar to that of a single-diode amplifier. Note that (35) was ob- 
tained on the assumption that the temperature of the resistive load 
or isolator after the amplifier is 7° Kelvin, and the noise figure is also 
normalized to T° Kelvin. If the load temperature is 0° Kelvin, we have 
to subtract G7Z/G1Z,/(1 + GiZ,) from the right-hand side of (35), 
which comes from the noise power from the load at T° Kelvin. 

If the load or isolator temperature, 7; , is higher than 7’, the noise 
power generated in the isolator travels toward the input side of the 
amplifier, reflects back at the input diode, and is amplified. Therefore, 
the higher the isolator temperature is, the smaller reflection must be 
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kept at the input diode. If the isolator temperature is much higher than 
the amplifier temperature, a matched condition to Z; at the input diode, 
i.e. 


G21 + Gah, 1 (30’) 


must be held in order to keep the effect of the hot isolator small. In this 
case, the power gain and the noise figure normalized to T° Kelvin are 
approximately obtained in the following equations: 








PG GnZiGZg (33’) 
Le 14 SF 4 _ 1) 
1 — Gai gy? we g (35’) 
fe Z ee te 
+ We i (4.21 + GaZy = 1) if ca > 1. 


The last term in parentheses represents the deterioration of noise figure 
due to input mismatching. 


V. OPTIMUM NOISE FIGURE OF THE AMPLIFIER WITH LOSSY DIODES 


To simplify the analysis, we assume that a variable-capacitance diode 
is represented by an equivalent circuit shown in Fig. 7(a). The junction 
capacitance C; is the only variable element and is given by the following 
equation: 


C; = C; + € COs apt (36) 


where w, denotes the pumping angular frequency and equals w; + w. 
The characteristic quantities of the diode, w,, , w) and Qp are defined as 
follows: the critical angular frequency 





where Q is a dynamic quality factor of the diode; the self-resonant angular 
frequency 
1 


ao = V/L.C; ; (38) 


and the diode Q at the resonant frequency 


D, 
Qo = F (39) 
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Co Cj JBd 





(a) (b) 


Fig. 7 — Equivalent circuits of a variable capacitance diode. 


Transforming the circuit of Fig. 7(a) into the form shown in Fig. 7(b), 
where Cerp IS an equivalent sinusoidally varying shunt capacitance, the 
new circuit parameters at frequencies away from the self-resonant fre- 
quency fo are given by the following relations: 








7 R.Qo 4 C3 wo @ Wo 
WO (63) 
1 1 
Get 
: RQ? (w wf (40) 
Wo (é3) 
2 
oo 
Ceff ww) seb C. 


Wy Wo We Wo 
Wa Wy] Wo We 


Figs. 8 and 9 show the frequency characteristics of Ba and Ga re- 
spectively for various values of a parameter Co/C; . 
From (40), we obtain the following relation: 
2 2 
core | Gort | er gay. (41) 
4 Wi We 
For a high-gain amplifier, the following condition should be satisfied 
from (20): 


Gro = (1 + Le) Gar % For | Cott ’Z1 (42) 
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where Lz is an external idler loading factor of the amplifier.* Substituting 
(41) into (42), the high-gain condition is expressed as follows: 
1 Wer” GauZ1 , 


= ~ 1. 2 
2 wwe 1 + Le | (4 ) 
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Fig. 8 — Frequency characteristics of an equivalent shunt susceptance of a 
diode. 


From (385), a noise figure expression for a high-gain amplifier is approxi- 
mately given by the following equation: 


Pes 1 + GaZ)) (1 =e “) (35”) 
We 


Eliminating GZ, from (35”) by using (42’), and differentiating (35”) 
* Originally introduced by M. Uenohara. 


UNILATERAL PARAMETRIC AMPLIFIER 1139 


with respect to w./w,, we obtain the optimum ratio of idler-to-signal 
frequency for the optimum noise figure, and the optimum noise figure 
for a given signal frequency f; , a diode critical frequency f., and an idler 
loading factor Le 


Foot = (1 += 20 + ay rn { + 5 Vi + S| (43) 


Wer 


— pe 


(2) W1 Qi (44) 
lice N/R) 9/2) 


Fig. 10 shows the numerical values of (F’)op, and (we/w1)opt - 

It is shown in (43) that the optimum noise figure is fairly good but 
slightly higher* than that of a single-diode parametric amplifier. The 
reason for the higher noise figure is attributed to the fact that, to obtain 
the same amount of gain with a given ratio of reactance swing, the 
quarter-wave coupled amplifier needs a lower idler frequency than that 
of a single-diode reflection-type amplifier. However, for reflection-type 
amplifiers it may not be possible to use the optimum idler frequency in 
order to obtain a wide bandwidth, and a small difference in thermal noise 
 * For a single-diode parametric amplifier, the optimum noise figure and the 


optimum ratio of idler-to-signal frequency are obtained in terms of a critical fre- 
quency and idler loading factor, as follows: 


Pops = (4/1 7 (2) (1 + Ly) — vIFE| 


Wer 


LZ | a ere 
~ (4/14 arg) 
Q 
(2) WC aa eee | 
1 J opt ear 


1 


aa 4/4} the, Vit 
Qr Q1 











If Q: > 1, these equations yield 


1 pe 
Pac ( +> VJS1+ ts) 
Q1 


We Pe) O1 
a NS > .. 
1 fF opt V1 + Ly 


These are the same shown in (43) and (44) if Q: is replaced by 3/2. 
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Fig. 9 — Frequency characteristic of an equivalent shunt conductance of a 
diode. 


becomes negligible in comparison with the noise contribution from 
antenna circuit losses when the amplifier is refrigerated at a very low 
temperature; therefore, the disadvantage of slightly higher noise per- 
formance of the unilateral amplifier seems to be not always a scrious 
problem in practical applications. 


VI. DESIGN CONSIDERATION OF THE AMPLIFIER 


In previous sections, we discussed the amplifier characteristics in 
general terms. Now we have to consider what kind of diode is necessary 
and what image impedance we have to choose in order to obtain a pre- 
scribed gain at a prescribed impedance of the system. And also we have 
to consider what characteristics the amplifier should have without pump- 
ing in order to have gain with pumping. . 

From the equivalent circuit of the diode shown in Fig. 7, the equivalent 
change in shunt susceptance AB due to a static change in the junction 
capacitance AC; is given as follows: 

ABH oe, = QGa ~ 7 : (45) 
If the static change in the junction capacitance AC’; equals the amplitude 
of sinusoidal variation, |c|, (45) yields the resultant change in the 
equivalent susceptance as follows: 


AB = 2(wer/w) Ga . (46) 
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Multiplying (46) by Z, and substituting it into (42’), we obtain the 
following equation: 


GY SB Ae EB). (47) 
Wer 


It is seen in (47) that in order to obtain a high-gain amplifier, when the 
diode bias is varied over the entire range for making a passive test of the 
amplifier, the susceptance change normalized by the image impedance 
Z, at the signal frequency should be twice as much as the amount given 
in the right-hand side of (47). Also, the amplifier network should be a 
piece of matched transmission line if it is not pumped, and the two diodes 
in the amplifier should have the opposite direction of change in suscept- 
ance at the signal frequency when the bias voltage is changed in the 
same direction. If we have external idler ports for loading the amplifier, 
the two diodes should have the same direction of change in susceptance 
at the idler frequency. In Fig. 11 are shown required amounts of suscept- 
ance, which are twice the amount shown in (47), to fulfill the condition 
of unilateral amplification, as functions of idler to critical frequency ratio 
for various idler loading factors. 

The bandwidth problem of this amplifier is not so straightforward as 
the noise figure problem. As mentioned in (21) of Section III, the ap- 
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Fig. 10 — Optimum noise figure of the unilateral parametric amplifier. 
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Fig. 11 — Required susceptance change at the signal frequency for high-gain 
amplification. 


proximate bandwidth is determined by the idler circuit that can be im- 
proved by a multiple-tuning technique. However, in order to utilize a 
broadband idler circuit, the signal network should have a broad enough 
bandwidth to prevent any oscillations. The reason for this is as follows: 
in order to obtain a unilateral amplification a larger capacitance swing 
is needed than for a reflection-type amplification. Thus, wherever im- 
pedance conditions outside the signal frequency band become favorable 
for ordinary reflection-type amplification, the amplifier tends to change 
its mode of operation and breaks into oscillation. For this reason, not 
only must the image impedance of the network at the signal frequency be 
flat, but the phase constant also must be less frequency-sensitive in the 
frequencies where the amplifier has a gain. [I’or a closer investigation of 
the stability problem, we have to study (9).] 
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If both signal and idler frequencies are far from the self-resonant fre- 
quency of the diode, an unloaded Q of the diode at a given angular fre- 
quency w, which is approximately given by 


@& OBa 


Qui © 2G, Qo.” 


(48) 
is an important factor in determining the bandwidth of the amplifier. 
This factor is expressed by the following equation: 


2 
Qua rs +4 He (2 8), (48’) 
2 7 0 \WO w 
Fig. 12 shows the frequency dependence of Q.a for various values of 
C)/C;. In Fig. 12, it is shown that the idler frequency should be close 
to the self-resonant frequency in order to obtain a large bandwidth. If 


the idler circuit is a single-tuned circuit with a loaded Q of Q12 , the gain- 
bandwidth product is roughly given by 


) rant 2 
(2 38db VPG ~ fi Qie ; ay) 





6.1 Numerical example 


Suppose the signal and idler frequencies and diode parameters are 
given as follows: 


Rs 
AS 

















Fig. 12 — Frequency characteristic of a normalized unloaded Q of a diode. 
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fi = 4 ge 
fe = 8 ge 
C; = 0.65 pf 
DL, = 0.6 nh 
Cy = 0.15 pf 
fer = 30 ge 
From (38), 
1 


fo = SENT. = 8.07 ge. 


Therefore fo is very close to fr . 


From (37), 
30 ge = = ; ma i 
“ R, = 2.05 ohms. 
From (39), 
Qo = 14.8. 
From the values of Co and C;, 
C)o/C; = 0.231. 
From the values of fi, fe and fo, 
fi/fo = 0.496 
fe/fo = 0.992 


fo/fo = (fr + fr) /fo = 1.488. 
From Figs. 8 and 9, 


R,QBa = 0.80 © Ba = 26.3 m mhos 
R,QoBap = —().85 .. Bap = —28.0 m mhos 
Rio Gui = 0.46 ate Ga = 102m mhos 


3.20 m mhos, 


R.Qc Gap 1.44 ‘are Gap 
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where the subscript p denotes a quantity for the pumping frequency. 
From (46), 

AB, = 15.3 m mhos. 
Assuming LZ, = 0, the right-hand side of (47) yields 
A(w2/wer) = 1.07. 


Therefore, in order to have a good gain, the following condition should 
be held 


Z,AB, > 1.07 
*. Z, > 70 ohms. 


Suppose we use 80 ohms for Z, , which satisfies the above condition. The 
expected noise figure at room temperature is obtained from (35”) as 
follows: 


F = 1.081 XK 1.49 = 1.61 = 2.1 db. 


If the amplifier and isolator are both refrigerated at 78° Kelvin, the 
noise temperature defined by 


T, = (F — 1) X 290°K 
becomes 
T, = 48°K 


If the amplifier is further refrigerated at 42°K, the amplifier noise tem- 
perature is 

T, = 2.6°K. 
If the input mismatching VSWR normalized to Z, is 1.2, the noise 
temperature of the amplifier increases by 1° Kelvin even with the 
isolator refrigerated at 78° Kelvin [ef. (35’)]. 


When we use a single-tuned idler circuit without external loading, 
Qin & Qo, the percentage gain bandwidth product obtained from (49) is 


(Af/fi)sav PG & 27 per cent. 
If we choose 16 db gain, the bandwidth is calculated as 
(Af/fi)san & 4 per cent or (Af)san & 160 me. 


With a double-tuning technique, the bandwidth may be improved up 
to 300 me. 
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VII. CONCLUSION 


Performance characteristics of a unilateral parametric amplifier with 
two diodes separated by a quarter wavelength at the signal frequency 
have been theoretically investigated on the basis of a scattering matrix 
representation. It is shown that a broadband signal circuit is essential 
in order to obtain a unilateral gain; otherwise, the amplifier may easily 
oscillate. The reason for this is that a unilateral amplifier requires a 
larger capacitance swing than an ordinary bilateral amplifier. It is shown 
that it is possible in principle to obtain any amount of low noise ampli- 
fication without adjusting an input and output matching network of 
this amplifier, though the optimum noise figure is slightly higher than 
that of an ordinary reflection-type amplifier. This higher optimum noise 
figure does not always present a serious problem for practical applica- 
tions, because the idler frequency is often determined by other factors 
such as broadbanding or pump availability. The bandwidth is primarily 
determined by the idler circuit. Broadbanding by introducing a mismatch 
in the signal circuit is not practical because it deteriorates the unilateral 
characteristic. A numerical example is given to show the potentiality of 
building an amplifier at 4 ge which has a 2.1 db noise figure and more 
than 300 me bandwidth at 16 db gain. Since this amplifier does not have 
substantial reverse loss, it usually requires an isolator at its output port. 
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APPENDIX 


Substituting the following relations 


6; = (2m 4+ 1) 5 + 8 
0. = nr + 62 (50) 
6, = (2m + 2n + 1) 5 + 6 


into (10), and assuming 6’s are small quantities, each coefficient of the 
scattering matrix is modified as follows: 


A = 1 — Qaywon” — j2wywox”(Qaonk’bo + bp) (51) 
Sy: A = Soy A = J2aek" (61 + ZewyeoK"b. + 5p) 
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Sy: A = (=) Yih —- Qurwor” aa {il a Quwgk’ ) 
+ Quwox’ (be + by) i] 


Sor A = (—)" GIL + Qaywor” — 7{8)(1 -- 2a’) 
mies Deoyvor’ (62 — 5p) }] 
Si3-A = — (;/o2)Se-A = — wik exp (JOp0/2) 
{51 + 62(4onwex — 1) + ép} 
Sud = — (on/o2)S3o-A = (—)"wrx exp (78p0/2) (81 + &2 + 8p) (52) 
ete., where 


Ono = (2m + 2n + 1)(2/2). 


Equation (52) shows that the unilateral characteristic is deteriorated 
by the imperfect phase synchronization. And the amount of coupling 
between the signal wave in the reverse direction and the other three 
waves is determined by 6, + 62 + 6, for a high-gain amplification where 
Quik" & 1. An imperfect pump synchronization represented by 6, shifts 
the center of the amplification band as shown in (51). It is also shown in 
(51) that the bandwidth is affected mostly by a frequency-dependent 
idler phase constant if x is kept constant. 
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The Use of Wollaston Prisms for a 
High-Capacity Digital Light Deflector 


By W. J. TABOR 
Manuscript received March 23, 1964 


A digital light deflector was recently proposed by T. J. Nelson! in 
which n optical modulators and 7 uniaxial crystals were used to provide 
2” positions of the beam. Each uniaxial crystal was used to deflect a 
beam of light into either of two beams, with the light at the output 
remaining parallel to the input but displaced by an amount proportional 
to the thickness of the crysta]. For a large number of positions it was 
found that a lens had to be employed in order to focus the beam into a 
small spot. The use of the lens requires that converging or diverging 
light must pass through the uniaxial crystal. In this brief we point out 
that this converging beam, when passing through the crystal as an 
extraordinary ray, is subjected to an index of refraction which varies 
rapidly with angle. The digital light deflector using this type of deflection 
therefore has appreciable image distortion, and the limiting spot densities 
that can be achieved are less than would be predicted if only diffraction 
were important. 

A system which has less image distortion than the above can be con- 
structed by using Wollaston prisms? (similarly Rochon or Senarmont 
prisms) and parallel light. (See Fig. 1.) Only the first two prisms are 
shown in Fig. 1. A Wollaston prism has the property that a collimated 
beam incident to its first face will be deviated, depending on its polariza- 
tion, into either of two collimated beams. In this case the output beams 
will have an angular separation, whereas in the Nelson proposal the 
two output beams were laterally displaced without a change in angle. 
In general, » prisms will be used to provide 2” resolvable angles, and 
these angles can be displayed as 2” focused spots of light by means of 
a lens. This system has several advantages: (1) the parallel bundles of 
light contain only the angular variation implied by diffraction theory, 
which can be made small; (2) the light rays are always either nearly 
parallel or perpendicular to the optic axis where the index varies only 
slowly with angle and (8) the Wollaston prism contains much less mate- 
rial than the equivalent blocks of uniaxial crystal. 
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Fig. 1 — Use of Wollaston prisms in a digital light deflector. 


One disadvantage of the Wollaston prism is that the deviation angle is 
not constant as the incident angle of the parallel bundle of light is varied 
from the perpendicular direction. This may necessitate placing the Wol- 
laston with the smallest deviation first, the next largest second, etc. With 
this arrangement no Wollaston prism will have an incident angle differing 
from the perpendicular direction by amounts as large as the deviation 
angle of that prism. In this way the problem of the varying deviation 
angle should be minimized. 
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